On Convergence Rates Equivalency and Sampling 
Strategies in Functional Deconvolution Models 



Marianna Pensky, 
Department of Mathematics, University of Central Florida, 
Orlando, FL 32816-1364, USA. 

Theofanis Sapatinas, 
Department of Mathematics and Statistics, University of Cyprus, 
P.O. Box 20537, CY 1678 Nicosia, Cyprus. 

Abstract 

Using the asymptotical minimax framework, we examine convergence rates equivalency 
between a continuous functional deconvolution model and its real-life discrete counterpart, over 
a wide range of Besov balls and for the L^-risk. For this purpose, all possible models are divided 
into three groups. For the models in the first group, which we call uniform,, the convergence 
rates in the discrete and the continuous models coincide no matter what the sampling scheme 
is chosen and, hence, the replacement of the discrete model by its continuous counterpart 
is legitimate. For the models in the second group, to which we refer as regular, one can 
point out the best sampling strategy in the discrete model, but not every sampling scheme 
leads to the same convergence rates: there are at least two sampling schemes which deliver 
different convergence rates in the discrete model (i.e., at least one of the discrete models leads 
to convergence rates that are different from the convergence rates in the continuous model). 
The third group consists of models for which, in general, it is impossible to devise the best 
sampling strategy; we call these models irregular. 

We formulate the conditions when each of these situations takes place. In the regular 
case, not only we point out the number and the selection of sampling points which deliver 
the fastest convergence rates in the discrete model but also investigate when, in the case of 
an arbitrary sampling scheme, the convergence rates in the continuous model coincide or do 
not coincide with the convergence rates in the discrete model. We also study what happens 
if one chooses a uniform, or a more general pseudo-uniform, sampling scheme which can be 
viewed as an intuitive replacement of the continuous model. Finally, as a representative of the 
irregular case, we study functional deconvolution with a box-car like blurring function since this 
model has a number of important applications. All theoretical results presented in the paper 
are illustrated by numerous examples many of which are motivated directly by a multitude 
of inverse problems in mathematical physics where one needs to recover initial or boundary 
conditions on the basis of observations from a noisy solution of a partial differential equation. 
The theoretical performance of the suggested estimator in the multichannel deconvolution model 
with a box-car like blurring function is also supplemented by a limited simulation study and 
compared to an estimator available in the current literature. The paper concludes that in both 
regular and irregular cases one should be extremely careful when replacing a discrete functional 
deconvolution model by its continuous counterpart. 

Keywords: adaptivity; Besov spaces; deconvolution; Fourier analysis; Meyer wavelets; 
minimax estimators; multichannel deconvolution; thresholding; wavelet analysis. 
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1 Introduction 



We consider the estimation problem of the unknown response function /(•) based on observations 
from the following noisy convolutions 

y{u,t)=f*g{u,t) + ^z{u,t), ueU,teT, (1.1) 



where C/ = [a, 6] , — oo < a < 6 < oo, and T = [0, 1] . Here, z{u, t) is assumed to be a two-dimensional 
Gaussian white noise, i.e., a generalized two-dimensional Gaussian field with covariance function 
K[z{ui,ti)z{u2,t2)] = S{ui — U2)S{ti — t2), where S{-) denotes the Dirac 5-function, and 

f*g{u,t)= / f{x)g{u,t - x)dx, 
Jt 

with the blurring (or kernel) function g(-, •) also assumed to be known. 



The model (1.1) has been recently introduced by Pensky & Sapatinas (2009) and can be 
viewed as a functional deconvolution model. If a = 6, it reduces to the standard deconvolution 
model which attracted attention of a number of researchers, e.g., Donoho (1995), Abramovich & 
Silverman (1998), Kahfa & Mallat (2003), Johnstone et.al. (2004), Donoho & Raimondo (2004), 
Johnstone & Raimondo (2004), Neelamani, Choi & Baraniuk (2004), Kerkyacharian et.al. (2007), 
Cavalier & Raimondo (2007) and Chesneau (2008). 

The functional deconvolution model ( |1.1[ ) can be viewed as a generalization of a multitude of 
inverse problems in mathematical physics where one needs to recover initial or boundary conditions 
on the basis of observations of a noisy solution of a partial differential equation. Lattes & Lions 
(1967) initiated research in the problem of recovering the initial condition for parabolic equations 
based on observations in a fixed-time strip, while this problem and the problem of recovering the 
boundary condition for elliptic equations based on observations in an internal domain were studied 
in Golubev & Khasminskii (1999); the latter problem was also discussed in Golubev (2004). These 
and other specific models in mathematical physics were discussed in detail in Pensky & Sapatinas 
(2009). 



However, model ( 1.1 ) is just an idealization of a real- life situation. One can make observations 
only at particular points (n/, tj), / = 1,2,..., M, i = 1, 2, . . . , A/', so that the actual problem can 
be formulated as follows: recover the unknown response function /(•) from observations y{ui,ti), 
where 



y{ui,ti) = J f{x)g{ui,ti - x)dx + £ii, ui £ U, ti = i/N, (1.2) 
with £ii being standard Gaussian random variables, independent for different / and i. Model (|1.2|) 



can be viewed as a discrete version of the continuous functional deconvolution model (1.1). 

It is well documented in the literature that asymptotic equivalence between discrete and 
continuous models holds in some nonparametric models. In particular. Brown & Low (1996) and 
Brown, Cai, Low & Zhang (2002) in the univariate case, and Reiss (2008) in the multivariate case, 
established, under some restrictions, asymptotic equivalence (in Le Cam sense) between nonpara- 
metric regression and Gaussian white noise models. Although, to the best of our knowledge, such 
an asymptotic equivalence between continuous and discrete models, in the functional deconvolu- 
tion setting, has not yet been explored, it has been documented in the literature a convergence 
rate equivalency, in the asymptotical minimax sense, between standard continuous and discrete 



deconvolution models, i.e., when a = b, M = 1 and A^ = n in (1.1) and (1.2), over a wide range 
of Besov balls and for the L''-risks, 1 < r < oo, e.g., Chesneau (2008), Pensky & Sapatinas (2009) 
and Petsa & Sapatinas (2009). 
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For the above reason, and using the asymptotical minimax framework, one may attempt to 
study the continuous functional deconvolution model ( 1.1 ) instead of its discrete counterpart ( 1.2 1, 
assuming that the convergence rates between these models coincide. However, in this case, this 
equivalence has only a limited scope. Indeed, Pensky &; Sapatinas (2009) only touched upon the 
issue, showing that, under very restrictive conditions, a convergence rate equivalence between the 
continuous functional deconvolution model (1.1) and its discrete counterpart (1.2) models holds 
when n = NM, over a wide range of Besov balls and for the L'^-risk. Nevertheless, in majority 
of practical situations, these conditions are violated and it remains to be seen how legitimate the 
replacement of the real life model (1.2) by its idealization (1.1) is, even in the case of inverse 
problems in mathematical physics, presented in Pensky & Sapatinas (2009). In fact, in many 
situations, the convergence rates in the two models depend on the choice of M and the selection 
of sampling points ui, U2, . . . , um and may coincide with the convergence rates in the continuous 
model for one selection and be different for another. Also, from a practical point of view, the 
objective is not to find M and ui,U2, ■ ■ ■ ,um which make the two models equivalent from the 
convergence rate viewpoint, but rather to point out M and ui,U2, ■ ■ ■ , um which deliver the fastest 
possible convergence rates in the real life model (1.2). Note that the discrete model (1.2) can also 
be viewed as a multichannel deconvolution model where the number of channels M = is fixed 
or, possibly, Mn ^ cxd as the sample size n — >■ oo; the case when M > 2 (finite) was considered 
in, e.g., Casey & Walnut (1994) and De Canditiis & Pensky (2004, 2006). Hence, if the kernel 
g(-, •) is fixed, the choice of M and the selection of sampling points ui,U2, ■ ■ ■ ,um which provide 
the fastest convergence rates is of extreme importance in signal processing. 

Using the asymptotical minimax framework, our objective is to evaluate how legitimate it is 
to replace the real-life discrete model (1.2) by its continuous counterpart (1.1). For this purpose, 
we shall divide all possible models into three groups. For the models in the first group, which 
we call uniform, the convergence rates in discrete and continuous functional deconvolution models 
coincide no matter what the sampling scheme is chosen and, hence, the replacement of the discrete 
model by its continuous counterpart is legitimate. For the models in the second group, to which 
we refer as regular, one can point out the best sampling strategy in the discrete model (i.e., the 
strategy which leads to the fastest convergence rate), but not every sampling scheme leads to the 
same convergence rates: there are at least two sampling schemes which deliver different convergence 
rates in the discrete model (i.e., at least one of the discrete models leads to convergence rates that 
are different from the convergence rates in the continuous functional deconvolution model). The 
third group consists of models for which, in general, it is impossible to devise the best sampling 
strategy; we call these models irregular. 

We formulate the conditions when each of these situations takes place. In the regular case, 
not only we point out the choice of M and the selection of sampling points ui,U2, ■ ■ ■ ,um which 
deliver the fastest convergence rates in the discrete model but also investigate when, in the case 
of an arbitrary sampling scheme, the convergence rates in the continuous functional deconvolution 
model coincide or do not coincide with the convergence rates of its discrete counterpart. We 
also study what happens if one chooses a uniform, or a more general pseudo-uniform, sampling 
scheme which can be viewed as an intuitive replacement of the continuous model. Finally, as a 
representative of the irregular case, we study functional deconvolution with a box-car like kernel 
since this model has a number of important applications. All theoretical results presented are 
illustrated by numerous examples many of which are motivated directly by a multitude of inverse 
problems in mathematical physics where one needs to recover initial or boundary conditions on 
the basis of observations from a noisy solution of a partial differential equation. 

As in Pensky & Sapatinas (2009), we consider functional deconvolution in a periodic setting, 
i.e., we assume that /(•) and, for fixed n G [/, g{u, •) are periodic functions with period on the 
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unit interval T. Note that the periodicity assumption appears naturahy in the above mentioned 
special models which (1.1) and (1.2) generalize, and allows one to explore ideas considered in the 
above cited papers to the proposed functional deconvolution framework. Moreover, not only for 
theoretical reasons but also for practical convenience (see Johnstone et.al. (2004), Sections 2.3, 
3.1-3.2), we use band-limited wavelet bases, and in particular the periodized Meyer wavelet basis 
for which fast algorithms exist (see Kolaczyk (1994) and Donoho & Raimondo (2004)). In order to 
also allow inhomogeneous functions /(•) into our study, we consider a wide range of Besov balls, as 
it is common in the wavelet literature and, for simplicity, we work with the L^-risk only. However, 
the results of this paper can be extended to a more general class of L''-risks, 1 < r < oo, using the 
unconditionality and Temlyakov properties of Meyer wavelets, e.g., Johnstone et.al. (2004) and 
Petsa & Sapatinas (2009). 

The rest of the paper is organized as follows. In Section [2] we describe the construction of 
wavelet estimators of /(•) derived by Pensky & Sapatinas (2009) both for the continuous functional 
deconvolution model (1.1 ) and its discrete counterpart ( |1.2[ ). In Section [sj for the continuous model 
and for a discrete model with any particular design of sampling points, using the asymptotical 
minimax framework, we provide lower bounds for the L^-risk over a wide range of Besov balls and 
show that those bounds are attained by the wavelet estimators constructed in Section|2} Section|4]is 
devoted to the discussion of the interplay between continuous and discrete functional deconvolution 
models. First, Section 4.1 formulates the necessary and sufficient conditions for the convergence 
rates in continuous and discrete functional deconvolution models to coincide and to be independent 
of the choice of M and the selection of points ui,U2, . . . , um- Then, Section [4. 2| provides examples 
where these conditions do or do not take place, and Section 4.3 sorts all possible situations into 
the uniform, regular and irregular cases. Section |5] studies the regular case. In particular. Section 



5.1 designs the best possible sampling strategy. Section 5.2 provides some motivating examples. 
Section 5.3 investigates the relation between the L'^-risks in the continuous and the discrete models 
under an arbitrary sampling scheme and formulates conditions when the convergence rates do or do 
not coincide. Section 5.4 formulates sufficient conditions when the convergence rates in both models 
coincide under a pseudo-uniform sampling scheme in the discrete model. Section 5.5 provides a 
variety of examples where the convergence rates coincide or differ depending on what sampling 
scheme is employed. Section |6] explores the interplay between continuous and discrete functional 
deconvolution models in the case of a box-car like blurring function. Section [7] supplements the 
theory with a limited simulation study in the case of a box-car like blurring function and compares 
performance of the suggested estimator to the estimator proposed by De Canditiis & Pensky (2006). 
Concluding remarks are given in Section |8] Finally, Section [9] (Appendix) provides the proofs of 
the theoretical results obtained in the previous sections. 

In the rest of the paper, the continuous functional deconvolution model (1.1) is referred to 
as the "continuous model" and its discrete counterpart (1.2) is referred to as the "discrete model". 



2 Wavelet estimators 



For both the continuous model and the discrete model, we use the wavelet estimator derived in 
Pensky & Sapatinas (2009), described as follows. 

Let (/'*(■) and be the Meyer scaling and mother wavelet functions, respectively, in the 
real line (see, e.g., Meyer (1992)) and obtain a periodized version of Meyer wavelet basis as in 
Johnstone et.al. (2004), i.e., for j > and /c = 0, 1, . . . , 2^' - 1, 

^jkix) = Y^2^l'^^\2\x + i) - k), i;jk{x) = J^2J'/2^*(2i(3. + i)-k), xe T. 
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Denote (/, 5) = Jrp f{t)g{t)dt, the inner product in the Hilbert space L?'{T). Let em{t) = 
gj27rmt^ m G Z, and, for any (primary resolution level) jo > and any j > Jq, let fmjok = 
{em,Vjok), tpmjk = {em,'4'jk), fm = {em J) be the Fourier coefficients of ^Pjk{-), ipjk{-) and /(•), 
respectively. For each u £ U, denote the functional Fourier coefficients by 

ym{u) = {em,y{u,-)), gm{u) = {em,g{u,-)). 

In what follows we assume that function g{u,t) is such that gm{u) are continuous functions of u 
for every m. (This condition is not restrictive and holds in all examples considered below.) 

If we have the continuous model (1.1), then, by using properties of the Fourier transform, 
for each u £ U, we have hm{u) = gmi'>J')fm and 

yrn{u) = gm{u)frn + n~'^/'^ Zm{u) , (2.1) 

where Zm{u) are generalized one- dimensional Gaussian processes such that K[zmi{ui)zm2{u2)] = 
Smi,m2^{ui — U2), where 5mi,m2 is Kronecker's delta. If we have the discrete model (1.2), then, by 
using properties of the discrete Fourier transform, for each / = 1,2,..., M, (2.1 ) takes the form 

ym{ui) = gm{ui)fm + N'^^'^Zml, (2.2) 

where z^i are standard Gaussian random variables, independent for different m and I. 
Estimate the Fourier coefficients fm of /(•) by 

fm = (^J gm{u)ym{u)diL^ I (^j \gm{u)\'^ du^ in the continuous model, (2-3) 

fm = i 9m{ui)ymiui) j/f ^ |5'm(^i«)P ) ^ t^e discrete model. (2.4) 
^1=1 ^ ^ 1=1 ^ 

Here, we adopt the convention that when a = b, fm takes the form fm = (5m(fl)ym(a)/|5'm(a)P 
and somewhat abuse notation using fm for both functional Fourier coefficients and their discrete 
counterparts. 

Note that, using the periodized Meyer wavelet basis described above and for any jo > 0, any 
(periodic) /(•) £ -^^(^) can be expanded as 

2^0-1 oo 2J-1 

it) + Y.Yl b-Jk^Jk{t), t G T. (2.5) 

k=0 j=jo k=0 

Furthermore, by Plancherel's formula, the scaling coefficients, aj^k = ifi^jok): and the wavelet 
coefficients, bjk = {f,iJjk), of /(•) can be represented as 

^jok — ^ ^ fm'^mjoki bjk — ^ ^ fm'4^m,jkj (^-S) 

where Cj^ = {m : (pmjok / 0} and, for any j > Jq, Cj = {m : ipmjk / 0}. We estimate aj^k and bjk 
by substituting fm in (|2.6l) with (|2.3l) or (|2.4|), i.e.. 



^jok — ^ ^ fm'^mjoki bjk — ^ ^ fm'4^m,jk' (^•'^) 
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We now construct a (block thresholding) wavelet estimator of /(•), suggested by Pen- 
sky & Sapatinas (2009). For this purpose, we divide the wavelet coefficients at each resolu- 
tion level into blocks of length Inn. Let Aj and Ujr be the following sets of indices Aj = 
{r I r = 1,2, . . . ,2Vlnn} , [/,> = {k \ k = 0,1, . . . ,2^ - 1; {r - l)lnn < k < rlnn - l} . Denote 

Bjr = Yl ^Ifc' = Yl ^fc- (2.8) 

Finally, for any jo > 0, /(•) is constructed as 

2^0^! J-l 

fn{t) = Y ^JoWjokit) + bjkK\Bjr\ > Xj) i^jkit), t G T, (2.9) 

A:=0 i=3o reAj k£Ujr 

where 1(^4) is the indicator function of the set A, and the resolution levels jo and J and the 



thresholds Aj will be defined in Section 3.2 



In what follows, the symbol C is used for a generic positive constant, independent of n, while 
the symbol K is used for a generic positive constant, independent of m, n, M and ui,U2, ■ ■ ■ ,um, 
which either of them may take different values at different places. 

3 Minimax lower and upper bounds for the L^-risk over Besov 
balls 

Among the various characterizations of Besov spaces for periodic functions defined on LP(T) in 
terms of wavelet bases, we recall that for an r-regular multiresolution analysis with < s < r 
and for a Besov ball, B^g{A) = {/(•) G LP{T) : / e B^^, < A}, of radius A> with 

1 < P, 9 < cxD, one has that, with s' = s + 1/2 — 1/p, 

( /2J0-1 \l / oo .2^-1 ) 

with respective sum(s) replaced by maximum if p = oo or (7 = cxd (see, e.g., Johnstone et.al. (2004), 
Section 2.4). (Note that, for the Meyer wavelet basis, considered in Section [2| r = 00.) 

We construct below asymptotical minimax lower bounds for the L^-risk based on observa- 
tions from either the continuous model or the discrete model. For this purpose, we define the 
corresponding minimax L^-risks over the set Q as 

/?^(l^)=inf supE||/^-/f, (3.2) 

Riin,u,M) = inf supE||/„^ - ff, (3.3) 
/en 

Ri{n) = M Riin,u,M), (3.4) 

u,M 



where the infimum in (3.2) is taken over all possib le es timators (i.e., measurable functions) /^(•) 
of /(•) from the continuous model, the infimum in (3.4) is taken over all possible estimators /^(O 



of /(•) from the discrete model, based on a sample at M points u = {ui,U2 ■ ■ ■ ,um), and the 
infimum in (3.4) is evaluated over all possible estimators /^(•) of /(•) and the choices of M and u. 



Note that, since the asymptotical minimax convergence rates for the L^-risk in the discrete model 
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depends on M and u if these quantities are fixed, we are interested in the selection of M and n, 
minimizing the asymptotical minimax convergence rates for the L^-risk. 

Denote s* = s + 1/2 — \/p\ p' = min(p, 2), and, for k = 1, 2, define 



/ \gm{u)\''^du and Ti{m,u,M) = —Y,\g^{ui)\''\ (3.5) 



where rf(m) = \gm{o)\'^ when a = b. 

Pensky & Sapatinas (2009) constructed asymptotical minimax lower and upper bounds for 
the L'^-risk for the continuous model. The corresponding bounds for the discrete model were 
obtained under the very restrictive conditions that the upper and the lower bounds on r{'(m, u, M) 
do not depend on n, M and u. Below we shall need asymptotic lower and upper bounds for the 
L^-risk in the case of much more general expressions for rf(m) and Tf{m,u,M), than in Pensky 
& Sapatinas (2009). 

3.1 Minimax lower bounds: particular choice of sampling points 

Let, with some abuse of notation, ri(m) = Tf(m), R^{Bpg{A)) = R^{Bpg{A)), in the continuous 
model, and Ti(m) = Tf{m,u,M), Rl{B^g{A)) = R'}^{B^glA),u,M), in the discrete model. 

Assume that for some constants z^gM, AgM, a>0 and /3 > 0, independent of m and n, 
and for some sequence e„ > 0, independent of m. 



Ti{m) < Ksn \m\ ''(ln|m|) exp ( 



a\m\ 



i/>0 if a = 0. 



Denote n* = nSn and assume that the sequence Sn is such that 



n 



ner 



oo as n ^ oo. 



(3.6) 



(3.7) 



Then, the following statement is true. 



Theorem 1 Let {</'jo,fc(')) V'j,fc(")} periodic Meyer wavelet basis discussed in Section 



K Let 



s > max(0, 1/p — 1/2), 1 < p < oo, 1 < < oo and ^ > 0. Let assumptions (3.6) and (3.7) hold. 
Then, as n ^ oo, 



Rl{Bl,{A)) > < 



C(n*) 2S+2U+1 (lnn*)2s+2i^+i , if q = 0, v(2 - p) < ps* , 

2s* X 



C 



Inn* \ '2.s*+2v 
n* 



(lnn*)2s*+2>', if Q = 0, v{2-p)>ps*, 
if a > 0. 



(3.8) 



C(lnn*)"~ 

3.2 Minimax upper bounds: particular choice of sampling points 

Assume that for the constants i/GM, AgM, a>0 and /3 > and the sequence > in (3.6) 

Ti(m) > Ken\m\''^''{h\\m\)~^ 



exp 



am 



i/>0 if a = 0. 



(3.9) 



Let /n(-) be the wavelet estimator defined by (2.9). Let, as before, n* = n£n satisfy condition 
(3.7), and assume that in the case of q = in (3.9) the sequence e„ is such that 



/ii Inn < ln(l/e„) < (1 — /12) Inn 



(3.10) 
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for some constants /ii > and /12 G (0,1). Observe that condition (3.10) implies (3.7) and that 
Inn* X Inra as n ^ 00. Here, and in what follows, u(n) x v{n) means that there exist constants 
Ci > and C2 > 0, independent of n, such that < Civ{n) < u{n) < C2v{n) < cxd for n large 
enough. 

Choose jo and J such that 



2Jo=ln(n*), 2'^ = (n*)^, if a = 0, 



2J0 



ln(n*^ 
vr V 2a 



2^°, if a>0. 



(3.11) 
(3.12) 



(Since jo > J —1 when a > 0, the estimator ( |2.9[ ) only consists of the first (linear) part and, hence, 
Xj does not need to be selected in this case.) Set, for some constant /x > 0, large enough. 



Xj = ^2 (n*)-i ln(n*) 22^^/, if a = 0. 



(3.13) 



Note that the choices of jo, J and Xj are independent of the parameters, s, p, q and A of the Besov 
ball Bp q{A); hence, the estimator (2.9) is adaptive with respect to these parameters. 
Set {x)-^- = max(0,a;), and define 



p(2s+2u+l) ' '^V^ P) < pS , 

M±, if K2-P)=P5*, 

0, if u{2 -p) > ps*. 



(3.14) 



For any j > jo, let \Cj\ be the cardinality of the set Cj; note that, for Meyer wavelets, 
\Cj\ = A-k2^ (see, e.g., Johnstone et.al. (2004)). Let also 



A.(i) = |C,|-' ^ T«(m) [ri(m)]-2-, ^ = 1,2. 



(3.15) 



Direct calculations yield that under conditions (3.9) and (3.10), for some constants ci > and 
C2 > 0, independent of n, one has 



Ai(i) < <^ 



ci (e„)-i 22-^j\ 
^ C2 (£„)-! 22-J j^ exp { 



a 



■Svr 



if a = 0, 
, if a > 0. 



(3.16) 



The proof of the minimax upper bounds for the L^-risk is based on the following two lemmas. 



Lemma 1 Let assumption (3.9) hold, and let the estimators a j^^k o -nd bjk of the scaling and wavelet 
coefficients aj^k and bjk, respectively, be given by the formula (2.1) with fm defined by (2.S) in the 
continuous model and by (2.4) in the discrete model. Then, for all j > jo. 



E|ojofc - a^ofcp < Cn ^Ai(jo), IE|6jfc - bjk\'^ < Cn ^Ai(i). 



If a = and assumption (3.10) holds, then, for any j > jo, one has 

^bjk - bjkl'^ < Cn{lnnf^ (n*)~^. 



(3.17) 



(3.18) 
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Lemma 2 Let the estimators bjk of the wavelet coefficients bjk be given by the formula (2.7) with 
fm defined by (2.3) in the continuous model and by (2.4) in the discrete model. Let assumptions 



(3.9) (ifa = 0) and (3.10) hold. If 



where c\ and /i2 are defined in (3.16) and (3. jO^ , respectively, then, for all j > jo. 



(3.19) 



^ \bjk - bjk\'^ > (4n 



*^-l ^2 



Then, the following statement is true. 



(3.20) 



Theorem 2 Let fn{-) be the wavelet estimator defined by (2.9), with j'o and J given by (3.11) (if 
a = or (3.12) (if a > 0) and fj, satisfying (3.19). Let s > l/p', 1 < p < oo, I < q < oo and 
A > 0. Then, under the assumptions (3.9) and {3.1) if a > 0, or pTM) and (3.10) if a = 0, as 



n oo, 



sup E\\fn-fr<{ 



C{n*r 



c 



2s+2i/+l 
2s* 



(lnn)^+ 

[ C(ln(n*))-f , 



2s+2^+i , if a = 0, iy{2 -p) < ps* 

2s* X 

if a = 0, z^(2 — p) > ps* 

if a > 0. 



2s*+2i/ 



(3.21) 



Remark 1 Note that in the continuous model, one can write a lower bound for ri(m) in (3.6) and 
an upper bound for Ti(m) in (3.9) with e„ = 1, so that n* = n in (3.8) and (3.21 ). However, in the 



discrete model this may not be possible. Theorems [T] and [2] allow to account for the dependence 
of ri(m) on n in the case of the discrete model as well as for an extra logarithmic factor in the 
expression of Ti(m) which often appears in the case of the continuous model. 

Remark 2 Note that Theorems [T] and [2] can be applied even if the values of z/, A, a and /? in 
assumptions (3.6) and (3.9) are different, that may also depend on M and u. Then, Theorem 
[l] provides asymptotical minimax lower bounds for the L^-risk while Theorem [2] provides the 
corresponding upper bounds. If, in the continuous model or in the discrete model with some 
particular choice of M an d sampling points u, the values of z^, A, a and /? and the functions in 
conditions (3.6) and (3.9) coincide, then Theorems [l] and [2] imply that the estimator /n(-) defined 
by (2.9) is asymptotically optimal (in the minimax sense), or near-optimal within a logarithmic 



factor, over a wide range of Besov balls Bp g(A). Therefore, in the rest of the paper, when we talk 
about convergence rates we refer to the asymptotical minimax lower bounds for the L^-risk which 
are attainable, up to at most a logarithmic factor, according to Theorems [T] and [2} 



4 The interplay between continuous and discrete models: uni- 
form, regular and irregular cases 

The convergence rates in the discrete model depend on two aspects: the total number of obser- 
vations n = NM and the behavior of Tf{m,u,M). In the continuous model, the values of Tf(m) 



are fixed; they depend on m only and, hence, conditions (3.6) and (3.9) can be easily verified. 
However, this is no longer true in the discrete model; in this case, the values of Tf{m,u,M) may 
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depend on the choice of M and the selection of points u. If we require the values of Tf{m,u, M) 
to be independent of the choice of M and the selection of points u, then the convergence rates 
in the discrete and the continuous models coincide and are independent of the selection of points 



u. Moreover, in this case, the wavelet estimator (2.9) is asymptotically optimal (in the minimax 



sense) no matter what the choice of M is. It is quite possible, however, that in the discrete model. 



conditions (3.6) and (3.9) both hold but with different values of i^. A, a and j3 for different choices 
of M and u. In this case, the asymptotical minimax upper bounds for the L^-risk in the discrete 
model may not coincide with the convergence rates in the continuous model, at least for some 
sampling schemes. 

4.1 Necessary and sufficient conditions for convergence rates equivalency be- 
tween continuous and discrete models 

Assume that there exist points u^:,u* E [a, b], independent of m, such that, for any u G [a, b], 

\9miu)\ < K\gm{u*)\ and \gmiu)\ > K\gmiu^)\- (4.1) 

In this case, {b - a)K^\gm{u^)\^ < rf (m) < {b - a)K'^\gm{u*)\'^ and K'^\gmM\'^ <Tf{m,u,M) < 
K'^\9m{u*)\'^ for any M and u. Note that, based on the assumption on the blurring function g{-,-) 
made in Section [2] points and u* satisfying condition (4.1) always exist; however, they are not 
necessarily independent of m. 

Here, and in what follows, Um ^ means that there exist constants Ci > and C2 > 0, 
independent of m, such that < CiVm < Um < C2fm < 00 for |m| large enough. 

The following statement, which substantially extends Proposition 1 of Pensky & Sapatinas 
(2009), presents the necessary and sufficient conditions for the convergence rates in the discrete 
model to be independent of the choice of M and the selection of points u and, hence, to coincide 
with the convergence rates in the continuous model. 

Theorem 3 Let there exist constants ui G M, 1^2 £ IK, oi > 0, 02 > 0, /3i > and (52 > 0, 
independent of m and n, such that 

|9m(u*)|^ X |mr2''iexp(-ai|m|'^i), z^i > if ai = 0, (4.2) 
\gm{u^)\^ ■^\m\-'^''^e^v{-ci2\mf^), U2 > if a2 = 0. (4.3) 

Then, the convergence rates obtained in Theorems^^and^in the discrete model are independent 
of the choice of M and the selection of points u, and, hence, coincide with the convergence rates 
obtained in Theorems\^and^in the continuous model, if and only if 

a\a2 > and /3i = f32 or ai = 02 = and i^i = 1^2- (4-4) 

Remark 3 Theorem |3] provides necessary and sufficient conditions for the convergence rates in 
the continuous and the discrete models to coincide, and to be independent of the choice of M and 
the selection of points u. These conditions also guarantee asymptotical optimality (in the minimax 



sense) of the wavelet estimator (2.9), and can be viewed as some kind of uniformity conditions. 
Under assumptions (4.2)-(4.4), asymptotically (up to a constant factor) it makes absolutely no 
difference whether one samples the discrete model n times at one point, say, ui or, say, y/n times 
at M = ^/n points ui, 1 = 1,2,..., M. In other words, each sample value y{ui,ti), I = 1,2, ... , M, 
i = 1,2, . . . , N , asymptotically (up to a constant factor) gives the same amount of information 
and, therefore, the convergence rates are not sensitive to the choice of M and the selection of 



10 



points u. On the other hand, if the conditions of Theorem [3] are violated, then the convergence 
rates in the discrete model depends on the choice of M and u, and some recommendations on their 
selection should be given. Furthermore, optimality (in the minimax sense) issues become much 
more complex when Tf{m; u, M) is not uniformly bounded from above or below. 

4.2 Some illustrative examples 

Theorem |3] provides necessary and sufficient conditions for the continuous and the discrete models 
to be equivalent, from the viewpoint of convergence rates, no matter what the choice of M and 
the selection of points u are. The difficulty, however, is that many models do not satisfy those 
conditions. Below, we consider some illustrative examples that have recently been studied in 
Pensky & Sapatinas (2009). 

Example 1 Estimation of the initial condition in the heat conductivity equation. Let 

/i(t, x) be a solution of the heat conductivity equation 

dh(t,x) d'^h(t,x) r 1 r ,1 

' = ' ^e[0,l], t^[aM, a>0, 6<oo, 

with initial condition /i(0,x) = f{x) and periodic boundary conditions /i(t,0) = /i(t, 1) and 
dh{t,x) / dx\x=o = dh{t^x) / dx\x=i- 

We assume that a noisy solution y{t,x) = h{t,x) + n~^/'^z{t,x) is observed, where z{t,x) 
is a generalized two-dimensional Gaussian field with covariance function K[z{ti,xi)z{t2,X2)] = 
6{ti — t2)5{xi — X2), and the goal is to recover the initial condition /(•) on the basis of observations 
y{t,x). This problem was initially considered by Lattes & Lions (1967) and further studied by 
Golubev & Khasminskii (1999). 

Then, the functional Fourier coefficients gm{') are of the form gm{u) = exp(— 47r^m^ti), so 
that = b, u* = a, \gm{u*)\ = exp(— 47r^5m^) and \gm{u*)\ = exp(— 47r^am^) (see Example 1 in 
Pensky & Sapatinas (2009)). Hence, Theorem [s] holds with vi = 1J2 = 0, ai = An'^b, 02 = 47r^a and 
f3i = (32 = 2. Therefore, the convergence rates in the continuous and the discrete models coincide, 
and are independent of the choice of M and the selection of points u. 

Example 2 Estimation of the boundary condition for the Dirichlet problem of the 
Laplacian on the unit circle. Let /i(x, w) be a solution of the Dirichlet problem of the Laplacian 
on a region D on the plane 

d'^h(x,w) d'^h(x,w) , ^ ^ ^9 

with a boundary dD and boundary condition /i(x,t(;)|g^ = F{x,w). Consider the situation when 
D is the unit circle. Then, it is advantageous to rewrite the function h{-, •) in polar coordinates as 
h{x,w) = h(u,t), where u E [0, 1] is the polar radius and t £ [0,27r] is the polar angle. Then, the 
boundary condition can be presented as h{l,t) = f{t), and h{u, •) and /(•) are periodic functions 
of t with period 2it. 

Suppose that only a noisy version y{u,t) = h{u,t) + n~^^'^z{u,t) is observed, where z{u,t) 
is as in Example 1, and that observations are available only on the interior of the unit circle with 
u G [0,ro], ro < 1, i.e., o = 0, b = < 1. The goal is to recover the boundary condition /(•) on 
the basis of observations y(u,t). This problem was initially investigated in Golubev & Khasminskii 
(1999) and Golubev (2004). 
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(4.5) 



Then, the functional Fourier coefficients gm{') are of the form 

\gmiu)\ = Ku^"'^ = Kexp{-\m\ln{l/u)) u£[0,ro], (4.6) 

so that u=K = 0, n* = rg, \gmiu^:)\ = and \gmiu*)\ = K exp{—\m\ ln(l/ro)) (see Example 2 in 
Pensky & Sapatinas (2009)). Hence, the conditions of Theorem [s] do not hold, and we cannot 
be certain that the convergence rates in the continuous and the discrete models coincide for any 
sampling scheme. Actually, it is easy to see that if sampling is carried out entirely at the single 
point = 0, then r{^(m,'U*, 1) = and we cannot recover the boundary condition /(•). 

Example 3 Estimation of the speed of a wave on a finite interval. Let h(t, x) be a solution 
of the wave equation 

d'^h{t,x) _ d'^h{t,x) 
dt"^ dx'^ 

with initial-boundary conditions h{0,x) = 0, dh{t,x)/dt\^^^ = f{x) and h{t,0) = h{t, 1) = 0. 

Here, /(•) is a function defined on the unit interval [0, 1], and the goal is to recover the speed 
of a wave /(•) on the basis of observing a noisy solution y{t,x) = h{t,x) + n~^/'^z{t,x), where 
z{t, x) is as in Example 1, with t £ [a,b], a > 0, b < 1. 

Then, the functional Fourier coefficients gm{') are of the form 

go{u) = 1 and gm{u) = {2TT'm)^^ sin(27rmu), m G Z \ {0}, u £ [a,b], (4-7) 

(see Example 4 in Pensky &: Sapatinas (2009)). It is easy to see that in order to satisfy the condition 
( |4.1[ ) the points and u* should depend on m, and, hence, the convergence rates depend on the 
selection of Af and u. Hence, the convergence rates in the continuous and the discrete models may 
coincide for one selection of M and u and be different for another. Actually, it is easy to see that 
if M = 1 and u is an integer, then Tf{m, u, 1) = and we cannot recover the speed of a wave /(•). 



4.3 Possible cases 



Theorem |3] in Section 4.1 provides necessary and sufficient conditions for the convergence rates in 



the discrete model to be independent of the choice of M and the selection of points u and, hence, 
to coincide with the convergence rates in the continuous model. We can divide these conditions 
into the following two groups. 

Condition I. There exist constants z/i G M, ai > and (3i > and a point u* G [a, b], independent 
of m and n, such that 

|5m(n)|^ < K|(7m(n*)P X |m|-2^iexp(-ai|m|^i), i/i > if ai = 0. (4.8) 

Condition I*. There exist constants 1^2 G M., a2 > and f32 > 0, and a point £ [<i,b], 
independent of m and n, such that 

bm(n)|2 > K|(7^(n,)|2 X |mr2^2 exp(-a2|m|^2), 1/2 > if 02 = 0. (4.9) 

Condition II. Either aia2 > and /3i = /32 or ai = 02 = and vi = V2- 
Consider now the following three cases. 

1. The uniform case: Conditions I, I* and II hold. 
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2. The regular case: Condition I holds but Condition II does not hold. Condition I* holds 
or, possibly, \gm{u^)\ =0. 

3. The irregular case: Condition I does not hold. 



It is easy to see that Examples 1, 2 and 3 of Section 4.2 correspond to the uniform case, the 
regular case and the irregular case, respectively. 

Theorem |3] shows that in the uniform case, the convergence rates obtained in Theorems [T] 
and [2] in the discrete model are independent of the choice of M and the selection of points u, and, 
hence, coincide with the convergence rates obtained in Theorems [T] and |2] in the continuous model. 
In the uniform case one can replace the discrete model by the continuous model, no matter what 
M and u are. 

In the regular case, one cannot guarantee that the convergence rates between continuous 
and discrete models coincide. However, as we shall show below, one can still locate a point u* 
which delivers the best possible convergence rates. If sampling is done entirely at this point, then 
the discrete model can sometimes deliver better convergence rates than the continuous model. 
Nevertheless, if another sampling strategy is chosen, then the convergence rates in the discrete 
model may be worse than in the continuous model. Note that we do not require Condition I* 
to hold. This is due to the fact that Condition I* refers to the "worst case scenario" when we 
sample at the points which leads to the highest possible variance and, consequently, to the lowest 
convergence rates. One can also view |gm(M*) | = as an extreme case of Condition I* when 1^2 = oo 
or 02 = cti and P2 = 00. It is easy to see that if, in the discrete model, all sampling is carried out 
at u^: , then the convergence rates will be worse than in the case of sampling entirely at u* or than 
in the continuous model. Hence, in the regular case, sampling strategy does matter. 

In the irregular case, it is impossible to pinpoint the best sampling strategy which suits any 
problem; this is due to the fact that Condition I can be violated in a variety of ways. For this 
reason, we study a particular example of the irregular case, namely, functional deconvolution with 
a box-car like blurring function; this important model occurs in the problem of estimation of the 



speed of a wave on a finite interval (see Example 3 in Section 4.2 ) and, a discretized version of it, 
in many areas of signal and image processing which include, for instance, LIDAR (Light Detection 
and Ranging) remote sensing and reconstruction of blurred images (see Section [6| . 

5 The regular case 
5.1 The best discrete rates 

It is easy to see that, in the regular case, r^(m, u*,l) > KT^{m). Hence, it follows from Theorems 
[1] and [2] that, if the discrete model is sampled entirely at u* (i.e., M = 1 and ui = u*), then the 
asymptotical minimax lower and upper bounds for the L^-risk in the discrete model can be only 
lower than the respective lower and upper bounds in the continuous model. 



Denote by /^(•) the wavelet estimator of /(•) defined by (2.9) based on observations from the 
continuous model, and let /^(O = /^(n, M, •) be the corresponding wavelet estimator of /(•) based 
on observations from the discrete model evaluated at the point u. Denote fn*{') = fn{'^*^ 1' ')• 

Then, the following statements is true. 



Theorem 4 Let {</'jo,fc(')' V'i,fc(')} periodic Meyer wavelet basis discussed in Section^and 

assume that s > max(0, 1/p — 1/2) (for the lower bounds) or s > 1/p' (for the upper bounds), 
1 < p < 00, 1 < q < 00 and A > 0. Then, 

R^^{B;^^{A))>CRi{B;^^{A),u\l)^Ri{B;^g{A)). (5.1) 
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Also, for any choice of M and u, we have 



sup nft-ff<C sup nt-ff, (5.2) 



sup E||/f-/r<C sup nfn-fW- (5.3) 

Theorem |4] confirms that samphng entirely at the single point u* leads to the highest possible 
convergence rates in the discrete model. However, it does not provide an answer to the question 



whether the inequalities in (5.1) and (5.2) are strict or the convergence rates are the same in the 
continuous and the discrete models with sampling entirely at the single point u* . To get a better 
insight into the matter, let us consider a few more examples. 

5.2 More examples 

Example 2 (continuation). In the case of estimation of the boundary condition for the Dirichlet 
problem of the Laplacian on the unit circle, the functional Fourier coefficients gm{') are of the form 



(4.6) with ro < 1. Hence, u* = rg and Tf{m, u* ,1) x \gm{u*)\'^ x exp(— |m| ln(l/ro)). On the other 
hand, T^{m) x v?\"^\du = Tq '"'^^/(2|m| + 1) x |m|~^ exp(— |m| ln(l/ro)). Hence, by Theorems 
[T] and [2] the convergence rates in the continuous model coincide with the convergence rates in the 
discrete model if sampling is carried out entirely at the single point u* . 

Example 4 Let the functional Fourier coefficients gm{-) satisfy 

|5m(u.)|^ X |m-r^", 0<a<u<6<oo. 

Then, in the continuous model, rf(m) = j^\gm{u)\'^du J^exp(— 2'uln |m|)(in x |m|~^"(ln |m|)~^, 



implying that conditions ( |3.6| ) and (3.9) hold with = o, a = and A = 1. In the case of the 
discrete model, u* = a and Tf{m, u*, 1) x Igmiu*)]"^ x |m|~^° and conditions (3.6) and (3.9) hold 



with 1^ = u* , a = and A = 0. Hence, by Theorems [T] and |2] the convergence rates in the 
continuous model are worse than the convergence rates in the discrete (they differ by a logarithmic 
factor) model when sampling is carried out entirely at the single point u* . 

Example 5 Let the functional Fourier coefficients gm{') satisfy 

|5'm(^i)|^ ^< exp (-a|m|") , 0<a<ti<6<oo, (5.4) 

for some constant a > 0, independent of m. Then u* = a and Tf{m,u*,l) = |(7m('U*)P s< 
exp (—a|m|'^) . On the other hand, r{^(m) x exp (— a|m|") dn x (ln|?Ti|)~^ I\m\<i -^"^ exp(—az)dz, 
so that rf (m) > K |m-|~^(ln |m|)~"'^ I\m\'^ exp{—az)dz x |m|~^(ln |m|)~^ exp(— a|m|") and (m) < 
|m|~"(ln |m|)~-^ jj^l^, exp(— az)(i2: x |m|~"(ln |m|)~-^ exp(— a|m|"). 

Hence, by Theorems [T] and |2] the convergence rates in the continuous and the discrete models 
coincide if sampling is carried out entirely at the single point u* . 

Example 6 Let the functional Fourier coefficients gm{') satisfy 

\gm{u)\'^ ^ \m\~'^'^ exp{—u\m\^), < u < 6 < oo, (5.5) 

for some constants > and /? > 0, independent of m. Then, u* = and 

Tf{m,u\l)>:\gm{u*)\^>i\m\-^r (5.6) 
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On the other hand, it is easy to check that 

rf(m) X \m\-^'' / ex.p{-u\mf)du x \m\-'^^''+^\ (5.7) 
Jo 

Hence, by Theorems [T] and [2] the convergence rates in the continuous model are worse than in the 
discrete model when sampling is carried out entirely at the single point u* = 0. 



5.3 Conditions for convergence rates equivalency and non-equivalency between 
continuous and discrete models 

We shall say that the convergence rates in the continuous and the discrete models "almost coin- 
cide" if the convergence rates coincide up to, at most, a logarithmic factor when the convergence 
rates are polynomial = 0) or up to, at most, a constant when the convergence rates are 

logarithmic {a{u) > 0). We choose this distinction between the cases of polynomial and logarith- 
mic convergence rates since in the polynomial case the upper bounds for the risks of the adaptive 
estimator may differ from the corresponding lower bounds for the risk by a logarithmic factor. 

Hence, a question naturally arises: under which conditions on the choice of M and the 
selection of sampling points u do the convergence rates in the discrete and the continuous models 
almost coincide and under which conditions this does not happen. In order to answer this question, 
first we have to derive upper and lower bounds for the L^-risk in the continuous model. 

In what follows we assume that the functional Fourier coefficients Qmi') satisfy the assump- 
tion 

X |mr2'^(")exp(-Q(n)|m|'^(")), u e U, (5.8) 

for some continuous functions i^(-), a(-) and /?(•) defined on u € U, such that either a{u) = and 
u{u) > or a{u) > and (3{u) > 0, for all u £ U. Denote 



2s 



k{2s + 2v{u*) + 1) 
2s* 



if z^(n*)(2 -p) < ps*, 

(5.9) 

if u{u*){2-p) >ps*. 



I, k{2s* +2u{u*)) 
Then, the following statement is valid. 

Lemma 3 Let {<Pjo,k{-),'4'j,k{-)} be the periodic Meyer wavelet basis discussed in Section^ and 
assume that s > max(0, 1/p — 1/2) (for the lower bounds) or s > 1/p' (for the upper bounds), 
1 < p < oo, 1 < q < oo and ^ > 0. Let also the functional Fourier coefficients gm{') satisfy 



assumption (5.8). Denote 



argmiuugf/ i^(n), if a{u)=0, 
argmiuugf/ /3(n), if a{u) > 0, /3(n) 7^ const. 

Assume further that, in the neighborhood of point u = u* , the function (3{-) is continuously differen- 
tiable (if a{u) > 0, u £ U) or the function zv(-) is k-times continuously differentiable (if a{u) = 0, 
u £ U), where k > 1 is such that 

zy(^)(n*) = 0, s = l,...,k-l, i/W(-u*)^0, (5.10) 
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with denoting the s-th derivative of the function Then, the asymptotical minimax lower 

and upper bounds for the Lp'-risk in the continuous model are as follows: 



K{Bl^{A)) > I 



Cn 2''+2-("*)+i(lnn)'', if a{u) = ^, u{u*){2 - p) < ps* 

2s* 

C(^^y''^""-^'\lnny, if a{u) = 0, i^{u*){2 - p) > ps* 



(5.11) 



sup E\\f^-fr<{ 



C(lnn) if a{u) > 0, 

Cn"2s+2J'(l*)+i(inn)^+'^, if a{u) = 0, iy{u*){2 - p) < ps* , 
C7(i^)'^*+'"''"*'(lnn)''+'', if a{u)=0, i^{u*){2-p)>ps*, (5.12) 



C(lnn) 



if a{u) > 0, 



then k = oo in (5.10) and ?? = 0. 



Here p is given by (3.14) with v = v{u'), and "d is given by (5.9). If is a constant function, 



Remark 4 In Lemma |3] we do not consider the case when /?(«) is constant since this situation 
belongs to the uniform case and the convergence rates in the continuous and the discrete models 
coincide for any sampling scheme due to Theorem |3j Note also that the value of u* in Lemma [3] 
is always independent of m and easy to find. 

The utility of Lemma |3]is that it allows one to formulate conditions such that the convergence 
rates in the continuous model almost coincide with the convergence rates in the discrete model for 
any particular choice of a sampling scheme. 



Theorem 5 Let assumptions (5.8) and (5.10) hold. 



(i) If a{u) = 0, then the convergence rates in the continuous and the discrete models coincide 
up to at most a logarithmic factor if M = Mn and u are such that 



Tf{m,u,Mn) > i^e„|m|-2''("*)(ln|m|)-^i 



(5.13) 



for some constant Ai G M, independent of m and n, and for some sequence £n > 0, independent of 
m, satisfying 

lim e„(lnn)^2 > q (5.14) 

n— >oo 

for some constant A2 > 0. If, moreover. En, M = Mn and u are such that opposite inequalities 
hold, i.e.. 



Tf{m,u,Mn) < Ce„|m|~2''("*)(ln|m|)"^i and lim e„(lnn)^2 < cx), 



(5.15) 



for the same constants Ai and A2 as in formulae (5. IS) and (5.14), and if k in (5.10) is such that 



k{Xi + A2) = 1, then the convergence rates in the continuous and discrete models coincide up to 
constant. 

(a) If a{u) > 0, then the convergence rates in the continuous and discrete models coincide 
up to constant if M = Mn and u are such that 



Tf{m,u,Mn) > Ken\m\-'^'' exj) (^-a|m|^(''*)) (ln|m|)"^i 



(5.16) 



for some constants 1/ G M, Ai G M and a > 0, independent of m and n, and for some sequence 



En > 0, independent of m, satisfying condition (3.10). 
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Theorem |5] provides sufficient conditions for a sampling sclieme in the discrete model to 
lead to the convergence rates which are optimal or near-optimal. It follows from conditions (5.8 1 
and (5.10) and Theorems [T] and |2] that, if the discrete model is sampled entirely at u*, then the 
convergence rates in the continuous and the discrete models almost coincide. Namely, as n — > oo. 



( 2s 

(Jj^ 2s+2i/{u*) + l ^ 



C(lnn) 



if a{u)=0, v{u*){2 - p) < ps* , 
if a{u) = 0, 
if a(u) > 0, 



C(%^ if a{u) v{u*){2-p)>ps\ 

2s* 



(5.17) 



and 



sup nfn-ff<{ 



Cn 2s+2Kn*)+i (inn)^, 
_ C7(lnn) 



(«*) , 



if a(ti) = 0, v{u*){2 - p) < ps* , 

if a{u) = 0, u{u*){2-p) > ps*, (5-18) 

if a(u) > 0. 



From the above, it also follows that 



1, if a(n) > and /?(«) > 0, u e U, 

(Inn)'', if a(n) = 0, u e U, 



and, hence, the convergence rates in the discrete model cannot be better than the convergence 
rates in the continuous model if a(n) > and cannot be better by more than a logarithmic factor 
if a{u) = 0. 

We shall say that the convergence rates in the discrete model with sampling at M points u 
are "inferior" than the convergence rates in the continuous model if the convergence rates differ 
by more than a logarithmic factor for a{u) = or by more than a constant factor if a(n) > 0. The 
following statement shows when this happens. 



Theorem 6 Let assumptions (3.7), (5.8) and (5.10) hold and let lim„_»oo lii(e„)/ In n = eg < oo? 
for some sequence £n > 0, independent of m. 



(i) Let a(u) = and let assumption (3.10) hold. If M = Mn and u are such that 

Tf{m,u,Mn) < K£n\m\~'^''{ln\m\)~^ 



(5.19) 



for some constants A € M and ly > 0, independent of m and n, then the convergence rates in the 
discrete model are inferior to the convergence rates in the continuous model if 



V > ^{u*) and eq < 



2{u-i^{u*))/{2s + 2i^{u*) + l), if u{2-p)<ps* 
2{u -u{u*))/{2s* +2iy{u*)), if u{2 - p) > ps* 



or, 



V = v{u*) and lim e„ (Inn)'^ = for any a > 0. 

n^oo 

(ii) Let a{u) > and M = Mn and u he such that 



n\'ni\ 



-2u . 



exp 



Q|m|^ ) (In \m\y 



(5.20) 
(5.21) 

(5.22) 



for some constants G M, A G M and a > 0, independent of m and n. Then, the convergence rates 
in the discrete model are inferior to the convergence rates in the continuous model if 



j3 > f3{u*) and Eq > —1, or (3 = (3{u*) and Eq 



-1. 



(5.23) 
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Theorems formulate conditions in terms of Tf{m,u,Mn)- The following corollaries 

contain more specific results for various sampling schemes. 

Corollary 1 Let M = Mn be finite. Then, the necessary and sufficient condition for the conver- 
gence rates in the continuous and the discrete models to almost coincide is that for at least one I, 
I = 1,2, . . . ,M, one has v{ui) = v{u*) if a{u) = or I3{ui) = (3{u*) if a{u) > 0. 

Corollary 2 If a(u ) = and M = Mn < C(lnn)-^* for some constant X* £ [0,oo), then the 
convergence rates in the continuous and the discrete models almost coincide if one has y{ui) = v{u*) 
for at least one I, I = 1,2, ... , M. 

Corollary 3 If a{u) > and M = Mn < Cn'^ for some constant t S [0, 1), then the convergence 
rates in the continuous and the discrete models almost coincide if one has (3{ui) = f3{u*) for at 
least one I, I = 1,2, ... , M. 



5.4 Pseudo-uniform sampling strategies 



Theorems [5] and [6] and Corollaries [T] |2] and [3] in the Section 5.3 establish, in the case of an arbitrary 



sampling scheme, when the convergence rates in the continuous model almost coincide with the 
convergence rates in the discrete model, or when the convergence rates in the discrete model are 
inferior. 

However, when the discrete model is replaced by the continuous model, the underlying 
implicit assumption is that sampling is carried out at M = M„ equidistant points with M„ oo. 
In particular, the interval [a, h] is partitioned into M equal subintervals of the length A = {b—a)/M 
and ui = A(/ + d), I = 0, 1, . . . , M — 1, where d S [0, 1] is the parameter which allows one to 
accommodate various sampling techniques (e.g., d = Q, d=loTd = 1/2, respectively, when 
sampling is carried out at the left, the right and the middle of each sub- interval) . 

Below, we study an extension of this sampling scheme. We avoid treating ui,U2, . . ■ , um as 
a random sample since this is not the case in both mathematical physics and signal processing 
applications. Instead, in order to accommodate various sampling strategies, we consider a contin- 
uously differentiable function S{x), x G [0, 1], such that < si < S'{x) < S2 < oo and 5(0) = a, 
5(1) = b. Let d G [0, 1] and let 

ui = s(^-^^j^y l = l,2,...,M. (5.24) 

Denote the inverse of 5(n) by q{u) = 5^^(n), u G [a, 6], and observe that q{u) is continuously 
differentiable in [a,b] with < l/s2 < q'{u) < 1/si < oo. Many functions 5(-) satisfy these 
conditions, e.g., 5(x) = a + {b — a)x^, where < /i < oo (the case h = 1 corresponds to the 
uniform sampling). 



Theorem 7 Let assumptions l[5.3\ ) and (5.10) hold and let ui, I = 1,2, ... ,M , be defined by (5.24) 
where the function S{x), x G [0, 1], is continuously differentiable such that < si < S'{x) < S2 < 
oo and 5(0) = a, 5(1) = b. Then, the convergence rates in the discrete and the continuous models 
almost coincide if, for M = Mn, 

a{u) = and lim M~"'^lnn = ri < oo, or a{u) > and lim M~^lnlnn = T2 < oo. (5.25) 

n— »oo >oo 

If, moreover, \gmiu)\'^ = K\m\~'^'^^'^^ for some continuously differentiable function I'^u), u £ U, 
and also 

lim (lnn)^+^/^ = 0, 
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where k is defined in (5.10), then the convergence rates in the discrete and the continuous models 
coincide up to a constant. 



Remark 5 Note that if a{u) > in (5.8) and d in ( |5.24[ ) is such that f3{ui) = I3{u*) for some 
/, / = 1,2,... ,M, then a combination of Theorems [5] and [7] yields that the convergence rates 
in the discrete and the continuous models coincide for any value of M = Note also that, 

although conditions (5.251 in Theorem [7] are sufficient for the convergence rates in the discrete and 
the continuous models to almost coincide, examples in the next section demonstrate that these 
conditions are also necessary or close to being necessary: if the conditions in (5.25 ), or some slightly 
weaker conditions, are violated, then the convergence rates in the discrete model are inferior to 
the convergence rates in the continuous model. 



5.5 Examples revisited 



Example [2] (continuation). Recall that \gm{u)\'^ s< exp(— 2 ln(l/u)|m|), u G [0,ro], so that 
P = 1 and a{u) = 21n(l/ti). Hence, u* = ro and if the discrete model is sampled entirely at the 
single point u* , then the convergence rates in the continuous and the discrete models are given by 
formulae (5.11) and (5.12) or (5.17) and (5.18), respectively, and they coincide. 



However, the convergence rates in the discrete and the continuous models coincide under 
much weaker conditions. In fact, if M„ = 0{n'^) for some constant r G [0, 1) and ui = ri > 



for at least one /, I = 1,2,...,M, then rf(m,u,M) > n"^ exp(-2 ln(l/ri) 



\m\ 



and, by 



Theorem |5] the convergence rates in the discrete and the continuous models coincide. On the 
other hand, if ui = • • • = um-i = 0, um = ri > and M = Mn x n/lnn, then Tf{m,u,M) x 
n~^ln?7- exp(— 21n(l/ri)|m|) and, by Theorem [6| the convergence rates in the discrete model are 
inferior to the convergence rates in the continuous model. 

Now, consider the case of the pseudo-uniform sampling ui = £'((/ — l+d) /M), I = 1,2,..., M, 
with d £ [0, 1] and a function S{x), x £ [0, 1], satisfying the assumptions of Section 5.4 We will 



show that the convergence rates in the discrete and the continuous models coincide no matter what 



.,2|m| ^ 2|m| 



1=1^1 







On the other 



-1 + d) /M) - 5(0) >si{l-l + d) /M, 



{M~^ si{l - I + d)) 



2\m\ 



Here, 



the value of M is. To verify this, note that Tf{m,u, M) = 
hand, it is easy to see that since S{{1 — 1 + d)/M) = S{(1 

one has Tf{m,u,M) > M"! T.tM > ELm/2+i 
si < 1, due to 5(0) = 0, 5(1) = tq < 1 and < si < 5'(x), and, therefore, Tf{m,u,M) > 

EfiAf/2+i(0-5'Si)^'''"' = 0.5 exp(-2|m|log(2/si)) . Since ln(2/si) > 0, the convergence rates 
in the discrete and the continuous models coincide due to Theorems [T] and O 

We conclude this example by a rather obvious observation. Reducing the sampling interval 
from [0,ro] to [ri,ro], with ri > 0, yields = ri and Theorem [3] immediately becomes valid. For 
this reason, although |5m(^)| does not satisfy condition (5.8) (since |(7m(0)| = 0), the convergence 
rates in the continuous and the discrete models coincide for the majority of "reasonable" sampling 
schemes. Since, with the restriction < ri < n, the problem of the estimation of the boundary 
condition for the Dirichlet problem of the Laplacian on the unit circle simply reduces to the uni- 
form case, we can consider the problem as an example of an "almost uniform" case and conclude 
that replacing the discrete model by the continuous model is a legitimate choice. 



Example [4] (continuation). Recall that \gm{u)\'^ |m|~^", u G [a, 6], so that a{u) = 0, 
v{u) = 2ii, k = 1 and u* = a. If M = M„ = O ((Inn)'^ ) for some constant A* > and ui = a for 
at least one 1,1 = 1,2,... ,M, then, by Corollaries [T] and [2] the convergence rates in the discrete 
and the continuous models almost coincide. On the other hand, if ui = a but ui > a + d, d > 0, 
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for I = 2,3, ... , M, and M = M„ is such that hm^^oo M„(lnn)~'^ = oo for any constant A* > 0, 
then the convergence rates in the discrete model are inferior to those in the continuous modeL 

To verify this, note that under the assumptions above T{^(m, u, M) < K (M~^|m|~^" + |m|~'^^°"'"'^^) < 
Kmax (M~^|m|~-^", |m|~^('^"'"'^)) . Now, apply Theorem jsj first with e„ = and u = ^{u*) and 
then with = 1 and = I'^u*) + d. 

Now, consider the case of the pseudo-uniform sampling ui = S{{l — l+d) /M), 1 = 1,2,..., M, 



with d S [0,1] and a function S{x), x E [0,1], satisfying the assumptions of Section 5.4 By 
Theorem [7j the convergence rates in the discrete and the continuous models coincide up to, at 
most, a logarithmic factor if M = M„ is such that lim„^oo In n < oo. If, moreover, |g'm('u)P = 
K\m\~'^^ and M = is such that lim^^oo (Inn)^ = 0, then the convergence rates coincide 
up to, at most, a constant. In other words, in each case, the convergence rates in the discrete and 
the continuous models almost coincide. 

Let us show that the opposite is also true: if d > and M = Mn is such that 

lim M^^(lnlnn)"^lnn = oo, (5.26) 

n— >oo 

then the convergence rates in the discrete model are inferior to those in the continuous model. 
For this purpose, note that m - a = S{d/M) - 5(0) > sid/M = di/M, so that Tf{m,u,M) < 
iiT Iml"^^""'"'^^/*^^ = K\m\~'^'^ exp(— 2(ii In |m|/M). Since In |m| x Inn in this case, we have Tj^(m, u, M) < 
K En \m\~'^"' with e„ = exp(— 2di In \m\/M). Now, the fact that the convergence rates in the discrete 
model are inferior to those in the continuous model follows from Theorem [6] and the observation 



that condition (5.26) implies condition (5.21). The latter shows that the sufficient conditions of 
Theorem [7] are very close to being also necessary conditions in this case. 

Example [H] (continuation). Recall that \gm{u)\'^ exp (— Q|m|"), 0<a<u<b< 
oo, so that a{u) = a > and u* = a. Note that, by Corollary [s] if M = M„ is such that 
Mn < Cn'^ for some constant r G [0, 1) and ui = a for at least one I, I = 1,2,..., M, then the 
convergence rates in the discrete and the continuous models almost coincide. However, 'd ui = a 
and ui > a + d for 1 = 2,3,..., M, and M = Mn is such that M„ x n/lnn, then the convergence 
rates in the discrete model are inferior to those in the continuous model. To show this, note that 
Tf{m,u,M) <K [exp (— a|m|'*+'^) + Innexp (— a|m|'^)] x In nexp (— a|m|") since, in this 
case, \m\ x (Inn)^^"" and, thus, exp (— a|m|°'''''^) = o (n~^ Innexp (— a|m|")) as n ^ oo. Hence, 
application of Theorem [6] with e„ = Inn/n yields that the convergence rates in the discrete model 
are inferior to the convergence rates in the continuous model. 

Now, consider the case of pseudo- uniform sampling. By Theorem [7j the convergence rates in 
the discrete and the continuous models coincide if M = Mn is such that lim^— >oo In Inn < oo. 
Moreover, by Remark [5| the convergence rates in the discrete and the continuous models coincide 



whenever d = in formula (5.24), no matter what the value of M is. 



Let us show that, if d > 0, then the second condition in (5.25) is necessary in order the 
convergence rates in the discrete and the continuous models to coincide up to at most a constant. 
For this purpose, we assume that M = Mn is such that lim^^oo -^rT^ In Inn = oo and prove 
that the convergence rates in the discrete model are inferior to the rates in the continuous model. 
For this purpose, observe that ui > a + d/M for every /, I = 1,2, .. . ,M, so that Tf{m,u,M) < 

(din I m I \ 
—a\m\'^e m j . Now, recalling that, in this case, ln|7n| >c Inlnn and Inn* >c Inn, and 

repeating the proof of Theorem [T] with e„ = 1, we obtain that, for every n, in both the sparse and 
the dense cases as n ^ oo, 

Rn{B;^g{A),u,Mn) >Cilnn)-^^m. 
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Hence, the convergence rates in the discrete case are inferior to those in the continuous model 
whenever 

^ 2s*d 

lim (In n) ''+<i/Mn ^ a = g-^p ( — — In In n 



a{aMn + d) 



oo, 



which is true if limn 



M„ In In n = oo and d > 0. 



Example [g] (continuation). Recall that \gm{u) 



m 



-2u 



exp(— M|m|^), u G [0,6], and 

that conditions of Lemma [s] do not hold since a{u) = u > and a(0) = 0. We show that, in this 
example, the convergence rates in the discrete and the continuous models do not coincide. Recall 
that n* = and, due to formulae (5.6) and (5.7), Theorem [T] implies that, as n — > oo. 



Cn 2s+2^+/3+l, 



> 



c 



Inn \ 2s*+2i^+/3 



if 1/(2 -p) < ps* 
if 1/(2 — p) > ps* 



and 



Ri{B;^^{A)^Ri{B;^^{A),u\l)> 



_ 2s 

Cn 2.+2^+i, 



C 



2a' 

Inn \ 2s*+2i^ 



if z/(2 — p) < ps* , 
if z/(2 -p) > ps*, 



(5.27) 



(5.28) 



i.e., the convergence rates, in both discrete and continuous models, are polynomial. However, if 
one samples the model at > d, Z = 1, 2, . . . , M, then Tf{m,u, M) < C |m|~^'^ exp(— d|m|'^) and 
the convergence rates in the discrete model are logarithmic, i.e., as n ^ oo. 



Ri{B;^g{A),u,M)>C{lnny 



23" 





(5.29) 



Now, consider the pseudo-uniform sampling strategy ui = S{{1 — 1 + d)/M), 1 = 1,2,..., M, 
with a continuous differentiable function S{x), x G [0,1], such that 5(0) = 0, 5(1) = b and 

< si < S'{x) < S2 < oo. Since si((/ -l + d)/M) < S{{1 - 1 + d)/M) < S2{{1 - 1 + d)/M), 

1 = 1,2,..., M, one obtains, by direct calculations, that 



|^l-2!/g-S2(i|nt|'3/M 



M (l 



-S2 



\m\0/M^ 



< Tf{m,u,M) < 



^^^-2u^-sid\mf/M 

M {l- e~*il™l''/^) ' 



(5.30) 



Therefore, for M = M^, the convergence rates in the discrete model depend on the value of d and 
the asymptotic behavior of \m\^ /Mn- Let us now show that by choosing different values of d and 
M„, one can obtain each of the three convergence rates ( 5.27| )-( "5.29| ). 

If M„ is large (e.g., M„ > CnV(2;/+/3+i))^ ^j^^^^ |m|^/M„ ^ as n ^ oo, then 1 - 



-Si\mf/M„ 



m\^/Mn 



1,2. Therefore, T^{m,u, Mn) 



m 



-{2u+i3) g^j^j hence, the conver- 



gence rates in the discrete and the continuous models coincide and are given by (5.27). 



If M„ is small (e.g., M„ = O(lnn)), so that \m\" /Mn ^ oo as n — > oo, then (5.30) takes the 



form 



KM- 



-2u, 



-S2d\m\^/M„ < rf(m,u,M„ 



If M = Mn is finite and d > 0, then, by Theorems [T] and [2] the convergence rates in the discrete 
model are logarithmic, they are given by the right hand side of formula (5.29) and are inferior to 
the convergence rates in the continuous model. 

Finally, if M = is finite and d = 0, then the convergence rates in the discrete model 
are provided by the right hand side of formula (5.28) and are superior to those in the continuous 
model. For moderate values of Mn, one can obtain convergence rates in between (5.28) and (5.29). 
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6 Irregular case: a box-car like blurring function 



Suppose that the blurring function g{-,-) in the continuous model is of a box-car like, i.e., 

g{u, t) = 0.5 7(m) I{\t\ <u), ueU, teT, (6.1) 
where 7(-) is some positive function. In this case, functional Fourier coefficients gm{') satisfy 

go{u) = 1 and (7m(n) = (27rm,)^^ 7(n) sin(27rmn), mGZ\{0}, u G [a,b]. (6.2) 
It is easy to see that estimation of the initial speed of a wave on a finite interval (see Example 3 



in Section 4.2) leads to gm{-) of the form (6.2) with ^{u) = 1 (see (4.7)). 
Assume that 

71 < 7(^^) < 72 ,ue[a,b], (6.3) 

for some < 71 < 72 < cxd. (Obviously, this is true if 7(-) is a continuous function.) Under ( |6.3[ ), 
it is easily seen that 

rf(m) X m"^ (6.4) 

implying that conditions ( |3.6[ ) and ( |3.9| ) hold with u = 1 and a = 0. Consequently, in this case, 
using the results of Theorems [T] and [2] we can obtain the corresponding asymptotical minimax 
lower and upper bounds for the L^-risk. 

Consider now the discrete model. Recall from Section [T] that this model can be viewed 
as a discretization of the continuous model or as a multichannel deconvolution problem with M 
channels where n = NM denotes the total number of observations and, possibly, M = Mn 00 
as n — > 00. Note that multichannel deconvolution with box-car kernels (i.e., 7(u) = 1/u, for some 
fixed u > 0) is the common problem in many areas of signal and image processing which include, 
for instance, LIDAR remote sensing and reconstruction of blurred images. LIDAR is a lazer device 
which emits pulses, reflections of which are gathered by a telescope aligned with the lazer, see, e.g.. 
Park, Dho & Kong (1997) and Harsdorf & Renter (2000). The return signal is used to determine 
distance and the position of the reflecting material. However, if the system response function 
of the LIDAR is longer than the time resolution interval, then the measured LIDAR signal is 
blurred and the effective accuracy of the LIDAR decreases. This loss of precision can be corrected 
by deconvolution. In practice, measured LIDAR signals are corrupted by additional noise which 
renders direct deconvolution impossible. Moreover, if M > 2 (finite) LIDAR devices are used to 
recover a signal, then we talk about a multichannel deconvolution problem, leading to the discrete 



model described by (1.2). 



For any choice of M and selection of points u, under (6.3), we easily see that 



M ^ 47r2m2 M 

1=1 1=1 



It follows from (6.5 ) that for any choice of M and any selection of points u, we have 

Tf{m;u,M) < Km-^. (6.6) 

Hence, in this case, by Theorem [l] the asymptotical minimax lower bounds for the L^-risk in 
this discrete model cannot be lower than the asymptotical minimax lower bounds for the L^-risk 
obtained in the continuous model. 

However, it is impossible to find a point u* £ [a, 6], independent of m, such that, for any 
u G [a,,b], one has sin^(27rmti) < K sm'^{2TT'mu*); in other words, in this case. Condition I does 
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not hold and we deal with the irregular case here. It turns out that in the case of a box-car type 
kernel, sampling at any one point is not at all the best strategy. Indeed, Johnstone and Raimondo 
(2004) showed that in the case of standard deconvolution (M = 1, 7(n) = 1/u, u = u* = a = b), 
the degree of ill-posedness is v = 3/2. The latter means that the asymptotical minimax lower 
bounds for the L^-risk is given by Theorem [l] with a = and u = 3/2. Johnstone and Raimondo 
(2004) also demonstrated that if n* = a is selected to be a 'Badly Approximable' (BA) irrational 
number, then these lower bounds can be attained over a wide range of ellipsiods using a nonlinear 
blockwise estimator in the sequence space domain. 

The convergence rates obtained above can be improved by sampling at several different 
points. De Canditiis & Pensky (2006) studied the multichannel deconvolution problem with the 
box-car blurring function and derived that if M is finite, M > 2, one of the ui,U2, ■ ■ ■ , um is a BA 



irrational number, and n is a BA irrational tuple, then in formula (3.16) 



Ai(j) < C(M)i2^(2+i/^^) (6.7) 

(for the definitions of the BA irrational number and the BA irrational tuple, see, e.g., Schmidt 
(1980, p. 42) and also Section |8|. This implies that in this case, the degree of ill-posedness is at 
most u < \ + 1/(2M), meaning that if M > 1, then v is less than 3/2 (that corresponds to the 
case of sampling at a single BA irrational number). Furthermore, De Canditiis &: Pensky (2006) 
showed that the asymptotical upper bounds for the error (for the L''-risk, 1 < r < oo, and for a 
fixed response function /(•)) depend on M: the larger the value of M is the higher the asymptotical 
convergence rates will be. Hence, in the multichannel box-car deconvolution problem, it seems to 
be advantageous to take M = M„ — > oo as n ^ oo, and to choose u to be a BA irrational tuple. 
However, the theoretical results obtained De Canditiis &: Pensky (2006) cannot be blindly applied 
to accommodate the case when M = M„ — > oo as n ^ oo; this generalization requires, possibly, 
non-trivial results in number theory (see the discussion in Section [s]) . 



On the other hand, if conditions (6.1 ) and (6.3 ) hold and M = M„ oo fast enough as n 



oo, then it is not needed to employ BA irrational tuples, as we reveal below. If M = M„ — ^ cc fast 
enough as n ^ oo, then deconvolution with a box-car like blurring function in the discrete model 
can provide estimators with the same convergence rates as in the continuous model. The following 
statement shows that, if M = — > oo fast enough as n — > oo, then an appropriate selection of 



points u, can secure asymptotic relation similar to (6.4), thus, ensuring equal convergence rates in 
both the discrete and the continuous models. 



Lemma 4 Consider g{-, •) to be of the form (6.1) with 7(-) satisfying (6.3), and letO < a < b < oo. 
Let m € Aj, where \Aj\ = c2^ , for some c > 0, with (Inn)*^ < 2^ < vF^ , j > jo, for some 5 > 
and jo > 0. Take ui = a + {b - a)l/M, / = 1, 2, . . . , M. If M > Mon = (327r/3)(6 - a)n^/^ then, 
for n and \m\ large enough, 



T^{m;u,M) > Km~\ (6.8) 

Note that Lemma |4] can be applied if M = M„ > cqh^^^ for some constant cq > 0, inde- 
pendent of n. Let A = min(3co/(327r), 6 — a). Set M = Mn, ui = a + lA/M and observe that 
ui € [a, b] for / = 1,2,..., M. Then, the following statements is valid. 

Theorem 8 Let {4>jo,k{'):'>Pj,k{')} be the periodic Meyer wavelet basis discussed in Section\^ Con- 



sider g{-, •) to be of the form Jg. 1^ with satisfying (6.3), and let < a < b < oo. Let 
K{B;,,{A)) to be either RI{B;^^{A)) or Ri{B;^^{A)) . 
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(Lower hounds) Let s > max(0, 1/p — 1/2), 1 < p < oo, 1 < q < oo and ^ > 0. Then, as 
n — > oo, 

Cn"^, if s > 3(l/p- 1/2), 

l_ (6.9) 
C{'-^]''^\ if s<3(l/p-l/2). 



K{b;ja)) > 



(Upper bounds) Let s > 1/p', 1 < p < oo, 1 < g < oo and ^ > 0. Set v = \ and assume that 
M = Mn > cnn ^^^ for some constant cq > 0, independent of n. Let fni') be the wavelet estimator 
defined by (2.9), with jq and J given by (3.11), and let f^{u,M, •) be the wavelet estimator defined 
by (2.9), evaluated at the poi nts ui = a + 1/S./M, / = 1, 2, . . . , M, where A = min(3co/(327r), h — a) 
and jo and J are given by (3.11). Let also /^(•) be either fn{-) or f^{u,M,-). Let s > 1/p', 
1 < p < oo, 1 < < oo and A > 0. Then, as n ^ oo. 



sup E||/°-/||2< 



Cn"^ (lnn)^ if s > 3(l/p - 1/2), 



C 



Inn \ s'+l 
n 



(lnn)^ if s<3(l/p-l/2). 



(6.10) 



w/iere £> = 3(2/p-l)+/(2s + 3) if s > 3{l/p - 1/2), Q = {l-p/q)+ if s = 3{l/p - 1/2) and g = 
if s<3{l/p- 1/2). 



7 A limited simulation study 

Here, we present a limited simulation study in the multichannel deconvolution model with a box- 
car like blurring function. We assess the performance of the suggested block thresholding wavelet 



estimator (BT) given by (2.9), with equispaced selected points ui = l/M, I = 1,2, . . . , M , and 
compare it to the term-by-term thresholding wavelet estimator (TT) proposed by De Canditiis & 
Pensky (2006), where the points ui, I = 1,2,..., M, were selected such that one of the u^'s is a 
BA irrational number, and ui,U2, ■ . ■ ,um is a BA irrational tuple (see Section 4 in De Canditiis 
& Pensky (2006)). 

Specifically, we assume that we observe 

y{ui,ti) = / f{x)g{ui,ti-x)dx + aieii, uieU = [0,1], ti = i/N, (7.11) 
Jt 

where g{ui,t) = (2ni)^^I(|t| < ui), ui £ U = [0,1], and en are standard Gaussian random 
variables, independent for different / and i. For simplicity, we assume that af = o"^ for all / = 
1,2,...,M. 

The suggested algorithm consists of the following steps: 
1. For each M = 4,8,16, generate M different equispaced sequences, yu (= y{ui,i/N)), I = 



l,2,...,M,i = l,2,...,N, following model (7.11). 



2. Generate functions 5r(ni,-), y{ui, ■) (pjoki') and Vifc(-)> J = io,io+l, • • • k = 0,l,.. . ,2^- 
1, at the same equispaced points, ti = i/N , i = 1,2, . . . , N. 

3. Apply the discrete Fourier transform (FFT) to gi, yi, (j)jf^k and ipjk, j = jo, jo + 1, . . . , J — 1, 
k = 0,l,...,2^ -1. 



4. Estimate aj^k and bjk by, respectively, aj^^k and bjk, given by (2.7). 

5. Compute Bjr = Ekeu^r ^k- 
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Figure 1: AMSE for the Bumps, Blip, Heavisine and Step functions sampled at a fixed number of 
A'^ = 128 points, based on RSNR=1, as the number of channels M (and hence the sample size n) 
increases. Solid line: BT wavelet estimator; Dash line: TT wavelet estimator. 



6. Compute the threshold Xj = a'^d*n ^ Inn Ai(j), j > jo, where n = NM, d* = 1. 



M N-1 



\\ M{N - 2) EE 



i=2 



yii-i 



'^yii ^ yi,i+i 






Ai(j) 



I a- 



m 



m£Ci 



(see Remark 6 in Pensky & Sapatinas (2009) and Miiller & Stadmiiller (1987)). 
7. Threshold the wavelet coefficients belonging to blocks with \Bjr\ < Xj. 



8. Apply the inverse wavelet transform to obtain fn{') given by (2.9). 



We used the test functions "Bumps", "Blip", "Heavisine" and "Step", and set jo = 3. For 
a fixed value of the (root) signal-to-noise ratio (RSNR=1), we generated S = 100 samples of size 
n = NM from model (7.11) in order to calculate the average mean-squared error (AMSE) given 

by 

S N IN 

E Y.^fn{U) - f{U)f j;/2(t.), U = i/N. 

m=l i=l I i=l 

In Figure [!} for a fixed number of data points N = 2"^ , we evaluate the AMSE as the number 
of channels M, and hence the sample size n, increases for the four signals mentioned above. 
Obviously, both BT and TT wavelet estimators improve their performances, as n increases, and 
the BT wavelet estimator appears to have smaller AMSE than the TT wavelet estimator in all 
cases. 

Although not reported here, we also evaluated precision of the suggested BT wavelet esti- 
mator for a wide variety of other test functions (see the list of test functions in Appendix I of 
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Antoniadis, Bigot & Sapatinas (2001)) and RSNR's, with very good performances. This numerical 
study confirms that under the multichannel deconvolution model with a box-car like blurring func- 
tion, block thresholding wavelet estimators with equispaced selection of points n^, Z = 1, 2, . . . , M, 
produce quite accurate estimates of /(•). 

8 Concluding remarks 

We considered the question of whether and when, in the functional deconvolution setting, it is 
legitimate to replace the real-life discrete deconvolution problem by its continuous idealization. 
In other words, using the asymptotical minimax framework, we studied whether the continuous 
model and the discrete model are equivalent for some or any sampling schemes from the viewpoint 
of convergence rates, over a wider range of Besov balls and for the L^-risk. It is worth mentioning 
that when we talked about convergence rates we referred to the lower bounds which are attainable 
up to, at most, a logarithmic factor according to Theorems [T] and [2| In the cases when convergence 
rates in the discrete model depend on the choice of a sampling scheme, we also explored the optimal 
sampling strategies. The conclusions of our investigation can be summarized as follows. 

If Conditions I, I* and II are satisfied then the convergence rates in the discrete model 
are independent of the number M and the choice of sampling points u and coincide with the 
convergence rates in the continuous model. In this case, which we call uniform, it is legitimate to 
replace discrete model (with any selection of sampling points) by continuous model. 

If Condition II does not hold, then there exist at least two different sampling schemes in 
discrete model which deliver two different sets of convergence rates, and at least one of these 
sampling schemes leads to the convergence rates different from the continuous model. However, if 
Condition I holds, one can point out the sampling scheme which delivers the fastest convergence 
rates, namely, sampling entirely at "the best possible" point u* . We refer to this case as regular 
and explore when, under an arbitrary sampling scheme, convergence rates in the discrete model 
coincide or do not coincide with the convergence rates in the continuous model. The case of 
sampling at u* is studied as a particular case. 

In addition, we consider convergence rates in the discrete model under uniform or pseudo- 
uniform sampling strategies. Indeed, when a discrete model is replaced by its continuous counter- 
part, it is implicitly assumed that sampling is carried out at M equidistant points in the interval 
[a,b]. We formulate conditions when this replacement is legitimate and bring examples when the 
uniform, or a more general pseudo- uniform, sampling may lead to convergence rates which differ 
from the convergence rates in the continuous model and are lower than when sampling is car- 
ried out entirely at the "best possible" point u* . Hence, even in the regular case, one should be 
extremely careful when replacing a discrete model by its continuous counterpart. 

Finally, we study the case when Condition I is violated. We referred to this case as irregular. 
In this case, the convergence rates in the discrete model depend on a sampling strategy and, in 
addition, one cannot design a sampling scheme which delivers the highest convergence rates. Since 
Condition I can be violated in a variety of ways, in the irregular case a general study is very 
complex. For this reason, we study a particular example of the irregular case, namely, functional 
deconvolution with a box-car like blurring function. This important model occurs, in the problem 
of estimation of the speed of a wave on a finite interval (Example 3) as well as, a discrete version 
of it, in signal and image processing (see Section |6]). In the case of a box-car like kernel, sampling 
at any one point is, by far, not the best possible choice and delivers lower convergence rates than 
the continuous model. The best choice for this model is uniform sampling with a large value of 
M = Mn- Indeed, if M = M„ > cqh^^^ for some constant cq > 0, independent of n, and the 
selection points ui, M2, • • • , um, are selected to be equispaced, then, according to Theorem [8) the 
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convergence rates in the discrete model with a box-car hke blurring function coincide with the 
convergence rates in the continuous model and cannot be improved. 

The assumption that M = Mn grows at least at a rate of n^^^ is very natural in the inverse 
mathematical physics problems: in fact, if one samples uniformly in the rectangle [0,1] x [a, 5], 
then Mn ^ \/n. However, this assumption is hardly natural in a signal processing setting where 
M corresponds to a number of physical devices, so even if M = M„ — > oo as n ^ oo, it grows at 
a very slow rate. For this reason, the question remains: if M = Mn ^ oo at a rate slower than 
0(n^/^) (e.g., M = Mn = c^n'" , where < f < 1/3, or M = M„ = C4(lnn)''', where 7 > 0, for some 
constants C3 > and C4 > 0, independent of n), can one select points ui £ [a, b], I = 1,2, ... , M, 
such that the convergence rates in the discrete model coincide with the corresponding convergence 
rates obtained in the continuous model? And, if for some such M = Mn the convergence rates in 
the discrete and the continuous models are not the same, what are the best convergence rates that 
can be attained and the best selection of points ui,U2, ■ ■ ■ , um'^ 

The solution of this question, possibly, rests on very nontrivial results in number theory. 
Recall that De Canditiis & Pensky (2006) showed that, if M is finite, M > 2, one of the u^'s is a 
BA irrational number, and ui, M2, • • • , um is a BA irrational tuple, then (6.7) is valid. The constant 
C(M) in (6.7) depends on the value of M and the choice of the BA irrational tuple. Let us now 
elaborate more on this. Note that the numbers ai, 02, . . . , um is a BA irrational tuple (see, e.g., 
Schmidt (1980, p. 42)), if, for any integers pi,P2, ■ ■ ■ ,Pm and q, there exists constant Bm such 
that 

max {\aiq - pi\,\a2q - P2\, ■■■ ,WMq - Pi\) > BmQ'^^^'^, 

where Bm is a positive constant that depends only on M. Schmidt (1980, p. 43) showed that, for 
a finite value of M, a BA irrational tuple always exist, and proposed an algorithm for constructing 
it. It is easy to note that Sjvf — *■ as M — > 00. The value of Bm affects the value of C{M) in 
(6.7) and, therefore, the convergence rates in the discrete model. 

Unfortunately, we are not aware of any results in number theory on how Bm depends on M, 
and we suspect that relevant results may not have yet been derived. However, a partial answer to 
the above question, showing that Bm > Coexp(— 3MlnM), for some Co > 0, independent of M, 
q and pi,p2, ■ ■ . ,Pmj and the construction of minimax upper bounds for the L^-risk over a wide 
range of Besov balls, covering the case M = M„ = o((lnn)"), where u > 1/2, have been recently 
obtained in Pensky & Sapatinas (2009a). 



9 Appendix: Proofs 

Recall that the symbol C is used for a generic positive constant, independent of n, while the symbol 
K is used for a generic positive constant, independent of m, n, M and ui,U2, . ■ ■ , um, which either 
of them may take different values at different places. 

Proof of Theorem [ij The proof of the lower bounds falls into two parts. First, we consider 
the lower bounds obtained when the worst functions / (i.e., the hardest functions to estimate) are 
represented by only one term in a wavelet expansion (sparse case), and then when the worst 
functions / are uniformly spread over the unit interval T (dense case). 

In the continuous model, one can always choose e„ = 1, so the only difference with Pensky 
& Sapatinas (2009) is an extra logarithmic factor. Since the differences for the discrete model are 
much more significant, we only consider below the proof for the discrete model. 

Sparse case. Let the functions fjk be of the form fjk = Jji^jk and let /o = 0. Note that by 

(3.1), in order fjf^ S Bp^^{A), we need 7j < A2^^^ . Set = c2~^^ , where c is a positive constant 
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such that c < A, and apply the following classical lemma on lower bounds: 

Lemma 5 (Hardle et.al. (1998), Lemma 10.1). Let V be a functional space, and let d{-,-) 

be a distance on V. For f,g £ V, denote by An{f,g) the likelihood ratio An{f,g) = dF (f)/dF (g), 
where dF (h) is the probability distribution of the process when h is true. Let V contains the 
functions /o, /i, . . . , /k such that (a) d{fk, fk') > 5 > Q for A; = 0, 1, . . . , k ^ k' ; (b) H > exp(An) 
for some A„ > 0; (c) lnyl„(/o, fk) = Unk~Vnk; where Vnk O'f^ constants andunk is a random variable 
such that there exists ttq > with Ffi_{unk > 0) > ttq; (d) sup^ Vnk < A„. 

Then, sup^gy P^(/) /) > 6/2j > 7ro/2 for any arbitrary estimator f. 

Let now V = {fjk : < A; < 2^' - l} so that H = 2^. Choose d{f,g) = \\f - g\\, where || • || 
is the L^-norm on the unit interval T. Then, d{fjk,fjk') = Ij = ^- Let Vnk = A„ = jln2 and 
Unk = lnA„(/o,/jfc) + jln2. Now, to apply Lemma [5| we need to show that for some ttq > 0, 
uniformly for all fjk, we have Ff^^{unk > 0) = F f^^[\D. An{fo, fjk) > -jln2) > ttq > 0. 

Note that in the case of discrete model 

N M 

-In An{fo,fjk) = 0.5^^ - ^ji^pjk * 9){ui,ti)f - y^{uuti)] = vjk - ujk, 

i=l 1=1 

where 

N M N M 

^3k = l3 ^^ii'jk* 9){ui,ti)eii, Vjk = 0.5'yj "^"^[{ipjk * 9){ui,ti)f . 

i=l 1=1 i=l 1=1 

Observe that, due to F{eii > 0) = F{eii < 0) = 0.5, we have F{ujk > 0) = 0.5. By properties 
of the discrete Fourier transform and taking into account that in the case of Meyer wavelets, 
lipmjkl < 2~-^/^ (see, e.g., Johnstone et.al. (2004), p. 565), we derive that 



1=1 



where 5„ = (4^)"i n2-i-f'j Y^mac, rf{m,u,M). 
Let 

j — jn be such that Bn < 0.5 j In 2. Then, by applying Lemma [5] and Chebyshev's 
inequality, we obtain 

inf sup E||/„ - ff > inf sup ]l]F{\\fn - f\\ > 7i/2) > 0.25 7|7ro. (9.1) 

fn f<^Bl^{A) fn fdV 4 

Thus, we just need to choose the smallest possible j = jn satisfying Bn < . 5 .7 In 2 ^ ev aluate 
7j = c2~^^ , and to plug it into (9.1 ). By direct calculations, we derive, under condition (3.6), that 

r Ken 2-J(2.-i)j-A^ if ^ ^ 0^ 

rf{m,u,M)<l (9.2) 
mec,- [ Ken 2-j(2i^+^-i)j-^ exp (-a(27r/3)^2J^) , if q > 0. 

Hence, if a = 0, then 2^" = C (n*(lnn*)-(^+i))^^^^''^^'''' and, if a > 0, then 2>- = C(lnn*)V/3. 
Now, to obtain the lower bound, plug 7j = c2^-'"'* into (9.1 ) 



inf sup E||/„ - ff > { 



2s' 2s'(\+l) 

C{n*)2s'+2,^ (Inn*) 2s'+2,. ^ if a = 0, 

(9.3) 

C{lnn*y^, if Q > 0. 
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Dense case. Let rj be the vector with components rjk = H, k = 0,1, . . . ,2^ — 1, denote by 



7i Y.k=o^ Vkil^jk- Let a. 



so rf be the vector with 



E the set of ah possible vectors r], and let fjrf 
components = (— loi i,k = 0,1, ... , 2^ — 1. Note that by (3.1 ), in order fj^j G Bp ,j{A), 
we need 'jj < A2~^^^'^^^'^\ Set 7j = c^,2~^^'^^^^'^\ where c* is a positive constant such that < A, 
and apply the following lemma on lower bounds: 

Lemma 6 (Wilier (2005), Lemma 2). Let An{f,g) be defined as in Lemma^ and let 
rj and fjn be as described above. Suppose that, for some positive constants A and ttq, we have 
¥f.^^{— In An{fjjji, fjri) < A) > vtq, uniformly for all fjrj and all i = 0, . . . ,2^ — 1. Then, for any 
arbitrary estimator f and for some constant L > 0, one /las max^g = Ej^^ ||/ — > LiToe"'^ 2^^'^'yj. 

Since, by Chebychev's inequality, Ff^^{lnAn{fjr,^,fjr,) > -A) > 1 - E/^. J In A„(/jy , /j-fc) |/A, 
we need to show that Ej^.^l In An{fj,^i , fjri)\ < Ai, for a sufficiently small constant Ai > 0. Observe 
that 



N M 



N M 



In Anifjn^Jjr,) = O-^lj^Yl^ia* fj-n' - fjv)i'^i^ti)f - ^j^^^H [id * [fjr,^ - fjv])iui,ti)] 

i=l 1=1 i=l 1=1 

Then, due to \ fj^^ - fjrj)\ = 2\^pjk\, one has E/^,^| lnyl„(/jy, < A„ + Bn, where 

N M N M 

An = 2-fj El'^'^i'tpjk* g){ui,ti)eii , Bn = 2-fj '^'^{xpjk * gf{ui,ti). 



i=l 1=1 



1=1 1=1 



Since, by Jensen's inequality. An < \/2Bn, we only need to construct an upper bound for Bn. Note 
that, similarly to the sparse case, one has Bn = 0{n2~^jj J2m&c '^ii''^^^^-^))- According to 



Lemma [6| we choose j = jn that satisfies the condition Bn + \/2Bn < Ai. Using (9.2), we derive 

that 2^" = C(n*(lnn*)-^)^^^^'^^''^^^ if a = and 2^" = C(lnn*)^/^ if a > 0. Then, Lemma|6| 
and Jensen's inequality yield 



inf sup E||/„ - ff > { 



C{n*) 2S+2U+1 (Inn*) 2s+2>.+i ^ if a = 0, 



^ C{lnn*yT, if a>0. 

Now, to complete the proof one just needs to note that s* = min(s, s'), and that 
2s/(2s + 2i/ + 1) < 2s7(2s* +2i/) if u{2-p)<ps*, 



(9.4) 



(9.5) 



with the equalities taken place simultaneously, and then to choose the highest of the lower bounds 
(9.3) and (9.4). This completes the proof of Theorem [T| □ 

Proof of Lemma [l| In what follows, we shall only construct the proof for bjk since the 
proof for Ojofc is very similar. Again, we construct the proof only for discrete model, since in the 
case of continuous model, one can always choose En = 1, so the only difference with Pensky & 
Sapatinas (2009) is an ex tra logarithmic factor. 

Note that, by (2.7), one has bjk - bjk = Y^mac Cfm - fm)'4'mjk, with 



M 

fm- fm = N'^/"^ ^ gm{ui) Z^l 



M 

E 

1=1 



(9.6) 
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where Zmi are standard Gaussian random variables, independent for different m and /. Therefore, 

M 

Ojk — Ojk\ 



-1 



since \Cj\ = A-k2^ and l^mjfcP ^2 If «; = 2, then 



mjk I 



meCi 



of2-^A^-2M-3A2(j) + iV~2M~2^?(j)) = o(2-^n-^M~^A2U) + n-^A^O')). (9.7) 



Direct calculations show that whe n a = one has A2(j) = 0(2^-''^j'^'*'e~'^). Plugging expressions 
for Ai(j) and A2(j) into formula (9.7) and taking into account that 2^ < 2'^^^ < {n*)^/^'^'^~^^\ one 
derives 

- b,,\' = O [y^^ + =0{n K)5^-^(lnn)3^ + in*)^^-\lnnr 

To complete the proof, observe that in the last expression, the second term is asymptotically 
smaller than the first. □ 

Proof of Lemma [2} Again we carry out the proof only for discrete case. The proof 
for continuous case can be obtain as a minor variation of the proof below. Consider the set of 
vectors f2jr = k £ Ujr ■ Y2k<^Ujr l^'^l^ — centered Gaussian process defined by 

Zjr{v) = J2keUjr '^ki^jk ~ bjk)- The proof of the lemma is based on the following inequality: 

Lemma 7 (Cirelson, Ibragimov & Sudakov (1976)). Let D be a subset o/ M = (— cxd,cxd), 
and let {(,t)teD be a centered Gaussian process. If K{supi^£) ^t) < Bi and sup^g^) Var(^f) < B2, 
then, for all x > 0, we have P( sup^g^) £,t > x + Bij < exp ( — / {2B2) ) . 



To apply Lemma [Tj we need to find Bi and B2. Note that, by Jensen's inequality, we obtain 



E 


sup Zjr{v) 


= E 









ki^Ui, 



1/2 



< 



'jk "jr'fcl 



keUr 



1/2 



< ^ = ni . 



(Here, ci is the same positive constant as in (3.16) with a = 0.) Also, by (|2.2[) and (9.6 



we 



have E[{bjk - bjk){bjk' - bjk')] = n ^ YlmeCj i^mjk-ipmjk'iTiim)] \ where Ti(m) is defined in (|3.5|. 
Hence, 

sup Var(Zj>(?;)) = sup ^ ^ VkVk' ^ tpmjktpmjk'[ri{m)]~'^ 



< ci(n*)-i22-^j^ E ^ ci(n*)-i22-ij^ = B2, 



k&U-j. 



by using X^^gc*^ 4'mjk4'mjk' = I(^ = ^') and (3.16) for a = 0. Therefore, by applying Lemma jrj 
with Bi and defined above and x = Bi ((2-y/ci)^^ fJi^fhi — l) 1 and noting that under condition 
(3.19), ln(n*) > /i2lnn, we derive 



2^ 



2^"^ ln( 



n 



4n* 



< 



exp 



li.Vh2 
2J^i 



' B- 



2B2 



< n 



-3 
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since (3.19) implies that 0.5 \li\fh2l {^\fo\) — 1]^ > 3. This completes the proof of Lemma [2J □ 
Proof of Theorem [2] First, note that in the case of a > 0, we have E||/„ - /f = iii + i?2, 

where 

^1 = E E ^'a., R2=Y. ^(^^ofc - «iofc)'> (9.8) 
j=J A:=0 fc=0 

since jo = J. It is well-known (see, e.g., Johnstone (2002), Lemma 19.1) that if / G ^^^(A), then 
for some positive constant c*, dependent on p, g, s and A only, we have 



2J-1 



(9.9) 



fc=0 



thus, i?i = 0(2-2-^'**) = O ((lnn*)-2s'//3). Also, using ( |3.16[ ) and ( |3.17[ ), we derive = 
O (n-i2^oAi(jo)) = O ((n*)-i/2(lnn*)2'^/^) = o ((In n*)-^^ , thus completing the proof for 
a > 0. 



Now, consider the case of a = 0. Note that by condition (3.10) one has Inn* x Inn. Due to 
the orthonormality of the wavelet basis, we obtain 



E||/„-/||2 = i?i + i?2 + i?3 + i?4, 



.10) 



where R\ and R2 are defined in (9.8), and 
J-i 



R^ = Y. Y. E ^ - ^ok? > ^i' {n*)-'2^''' ln(n*)/) 

j=jo reAj keUjr 

^4 = E E E ^ [^h ^^^^r < /i' (n*)-i22-^ ln(n*)i^)' 



j=jo r&Aj keUjr 



where Bjr and jj are defined by (2.8), (3.13) and (3.19^, respectively. 



Let us now examine each term in (9.10) separately. Similarly to the case of a > 0, we obtain 
Ri = O (2"2Js*^ ^ Q (^^-2s*/{2iy+i)^ _ gy calculations, one can check that 2s*/{2iy + 1) > 

2s/(2s + 2z^ + 1), if u{2 - p) < ps*, and 2s*/(2z^ + 1) > 2s*/(2s* + 2i/), if iy{2 - p) > ps*. Hence, 

O ^(n*)~2<i+2l'+i^ , if i'(2 -p)<ps*, 
0((n*)"2i^), if i^(2-p) >ps*. 



-Ri = < 



11) 



Also, by (3.17) and (3.16), we obtain 



R2 = ((n*)-i2(2'^+i)j«) = o (^(n*)^ra+T) = o ( {n*)'^^ 



).12) 



Denote 



e 



ijj : 



fefc - h^k? > 0.25^2 („*)-i22-. ln(n*)j^ 
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To construct the upper bounds for and i?4, note that simple algebra yields Rs < (-R31 + 
R32), R4 < (-R41 + R42), where 



J-i 



J-i 



' ^41 = E E E ^ , 

j=jo r£Aj keUjr 



= E E E [fe-^ - ^^■^)' ^(©^ 

J-i 

^32 = E E E ^ [fe-^ - V)' KB,r > 0.25^2 („*)-i22-^ ln(n*)i^) 
J-1 

= E E E ^ ^(^^-^ < 2-5/^' K)-i22-nn(n*)j^) 



Then, by (9.9), Lemmas [T] and [2] and the Cauchy-Schwartz inequality, we derive 



J-i 



i?3i+i?4i = o| E 

^i=jo reAj fce;7jr 



O I ^[V^(lnn)3V2(„*)-2{2l+T) +2-2^'^*] n-t | = 0((n*)-i), 



J-i 



provided ji satisfies (3.19). Hence, 



Now, consider 



A2 = RZ2 + Ri2- 

First, let us study the dense case, i.e., when v{2 — p) < ps*. Let ji be such that 

1 (2/p-l)+-A 

2-^1 = (n*) 2^+2"+! (Inn) 2^+2^+1 . 



).13) 
).14) 

).15) 



Then, A2 can be partitioned as A2 = A21 + ^22 j where the first component is calculated over the 
set of indices j'o < j < Ji and the second component over ji + 1 < j < J — 1. Hence, using (2.8) 
and Lemma [ij and taking into account that the cardinality of Aj is \Aj\ = 2-' /Inn, we obtain 



^21 = o E 



h V2{2u+i)jj\ ^ ^ 22^Jln(n*)j^ 



\'J=30 



O 



(In n 



n"^ 



(Inn)f , (9.16) 



where q is defined in (3.14). To obtain an expression for Z\22) note that for p > 2, by (3.10) and 
(9.9), we have 



O ( (n*) 2i,+2t+i (Inn) 2;>+2i+i 



If 1 < p < 2, then Bf^ = (EfceC/,, ^ifc )" ^ EfceC/,. I^ifcl^' so that by Lemma 



p/2 



(9.17) 



and since 
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1^(2 — p) < ps* , we obtain 

\j=ji+lr&Aj 
J-1 



n 



(9.18) 



0\ E (K)-i22-i/lnn)'-P/'2- 

U'=jl + 1 



+1 



Let us now study the sparse case when v{2 — p) > ps* . Let ji be defined by 2-^^ = 
(n*)2s+L+i(lnn) -i.+l+i _ Hence, if 5j> > 0.25^^ [n*)-^2'^''nn{n*)j^ , then < ^ 



g*2-2i** (see (9.9)) implies that j < j2 where j2 is such that 2^'^ = C[n*/(lnn)i+^]i/(2s*+2i.)^ where 



C depends on // and c* only. Again, partition A2 = A21 + A22, where the first component is 
calculated over jo < j < j2 and the second component over j2 + 1 < J < J —1. Then, using similar 



arguments to that in (9.18), and taking into account that u{2 — p) > ps*, we derive 

l-p/2 



Z\2i = O [ E [{n*)-^2'^''^j^lnn 
\j=jo 



O I E [{n*)~'^2'^''^ 



'j'^ In n 



E E i^.-i^ 

re A,- fcec/jv 



).19) 



To obtain an upper bound for /A22, recall (9.14) and keep in mind that the portion of Rs2 corre- 



sponding to j2 + 1 < j < J — 1 is just zero. Hence, by (9.9), we obtain 

^22 = I E E^'^) = E = O ([{n*)-Hlnnr'] 

\j=J2 + l k=0 I \j=i2 + l / ^ 



2a*+2i^ 



Now, in order to complete the proof, we just need to study the case when j/(2 — p) = ps* . 
In this situation, we have 2s/(2s + 2iy + 1) = 2s* /{2s* + 2i/) = 1 - p/2 and 2uj{l - p/2) = pjs*. 

Recahing (3.1) and noting that s* < s' , we obtain J2jZio i'^^^'* TX=o \^3kY'Y'^ < Then, we 



repeat the calculations in (9.19) for all indices jo < j < J — ^- If 1 < p < g, then, by Holder's 



J-i 



2^-1 



q/p- 



E 2^^'* E 1^.^ 



3=30 



k=0 



inequality, we obtain 

A2 = ((n*)-^(lnn)^+^y~^^^ (Inn)^-^/^ 

= 0^((n*)~^(lnn)^+^)^^ (lnn)i-P/«J. 

If 1 < 9 < then, by the inclusion ^^ ^(^4) C Bpp{A), we obtain 

/ -^"1 1- /2 \ / 

A2 = 0i E ((lnn)i+V^*) ^ ^ Ibjkl" = O ((In n)^+Vn*) 



p/i\ 



.20) 



2s* 
2s*+2i^ 



.211 



■-3=30 



k=0 



By combining ( |9.1lD -( [9l2| ), ( |9.13D , ( |9.16D -( [9l2T| ), we complete the proof of Theorem [2) □ 
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Proof of Theorem [Sj The first part of the theorem is identical to Proposition 1 of Pensky and 
Sapatinas (2009). The second part can be proved by contradiction. Assume that, assumptions 



(4.2) and ( |4.3[ ) hold but condition (4.4) does not take place. It follows from (4.2) and (4.3) that 

\gmM\^ < Klm]-^"^ exp{-ai\mf^), ui > if ai = 0, 
\gmiu*)\'^ >K\m\~'^''''exp{-a2\mf^), U2 > if 02 = 0, 



Observe that condition (4.4) of Theorem |3] can be violated only in one of the following ways: 
«! = 02 = but 1/2 < z^i, or Qi > but 02 = 0, or ai > and 02 > but P2 < Pi- 
Applying Theorem [T] with Af = 1, e„ = 1 and ui = u*, we arrive at, as n — > (X), 



Ri{B;^^{A),u,,l)>< 



Cn 2s+2.i+i^ if ai = f), ui{2-p) <ps* 



C 



Inn \ 2s*+2,.i 



2£_ 



^ C(lnn) 

On the other hand, applying Theorem [2] with M 



, if a\ = 0, v\{2 -p) > ps*, 
if ai > 0. 

= !,£„ = ! and ui = u* , we derive that, as 



n 



00, 



sup E||/^-/||2< <; 



Cn 2s+2,.2+i (Inn)^, if 02 = 0, 1^2(2 - p) < ps*, 

2a* 

(lnn)^ if a2 = 0, M2-p)>ps*, 



C 



^ C{lnn) 
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if 02 > 0, 



where p is given by formula (|3.14|) with v 



V2- Now, to complete the proof just note that if 
or aia2 > but P2 < Pi, then the asymptotical 
are higher than the corresponding upper 



02 = but 1^2 < z^i or ai > but 02 
minimax lower bounds for the L^-risk at the point u 
bounds at the point u = u*. Hence, in this case, the convergence rates cannot be independent of 
the choice of M and the selection of points u, arriving at the required contradiction. □ 



Proof of Theorem |4]. Note that the first inequality in formula (5.1) as well as relations ( |5.2[) 
and J5.3| ) between upper bounds in discrete and continuous cases follow directly from Theorems [T] 
and hi and from inequalities Tf{m,u*,l) > KTf{m) and Tf{m,u*,l) > KTf{m,u,M). Hence, one 



only needs to prove second asymptotic relation in formula (5.1). 

Let RiiB' JA),u, M) be the minimax L^-risk for fixed values of u and M, defined by formula 



(3.3), and let 



H{u, M,j) = 2-^ 7| ^fi^^ ^) ■ 



From the proof of Theorem 1 of Pensky & Sapatinas (2009), it follows that, in the sparse case 
(when 1/(2 - p) > ps*), one has Rn{B^^^{A),u, M) > 02'"^^"^"*, as n ^ 00, where jn = jn{u,M) 
is such that nH{u,M,jn)/jn = C. Similarly, in the dense case (when 1/(2 — p) < ps*), one has 
Ri{B^p^q{A),u,M) > C2-2ins^ as n ^ 00, where j„ = jn{u,M) is such that nH{u,M,jn) = C. 

Consider now two different values of M, say Mi and M2, and the corresponding sets of u's, 
say andM2- Tf{m,Ui, Mi) < Tf{m,U2,M2) for any m E Cj, then H{ui, Mi, j) < H{u2, M2, j). 
Observe that, for fixed M and u, both H{u,M,j) and H{u,M,j)/j are decreasing functions of j. 
Hence, if jni = jn{ui, Mi) and jn2 = jn{u2, -^2) are the values of jn corresponding to {ui. Mi) and 
(n2, M2), respectively, then j„i < j„2- To show that this is true in the dense case, observe that the 
opposite, jni > jn2, implies Cn~^ = H{ui,Mi,jni) < H{ui,Mi,jn2) < H{u2, M2, jn2), so that 
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jn2 cannot be the solution of equation H{u2, M2, jn2) = Cn~^ and jni > jn2 cannot be true. In 
the sparse case, one just needs to replace H{u, M,j) by H{u, M,j)/j. 

Now, it follows immediately that in both sparse and dense cases, Rfi^{Bp^^{A),Ui, Mi) > 
i?^(-Bp q(A), -^2)- Therefore, the infimum of Rf^{Bp^^{A),u, M) is attained at M and u such 
that Ti{m,u, M) = sup„ ^ r{'(m, u, M). Since, for any choice of M and any selection of points 
u, one has rf(m,u, M) < KTf{m;u*,l), the validity of the theorem follows from Theorem 1 in 
Pensky & Sapatinas (2009). □ 

Recall that rf(?n,) = J^^ \gmiu)\'^ du and Tf{m,u*,l) = \gm{u*)\'^. 
, a(-) and /?(•) are continuous functions on the interval U = [a,b], then 
there exist vi < U2, a\ < 02 and /3i < (32 such that ui < u{u) < 1^2, oi < a{u) < 02 and 
/5i < P{u) < (32, u £ [a, b]. Moreover, in the inequalities above, either qi = 0:2 = and vi > 0, or 
ai > and (3i > 0. Consider the cases when a) a{u) = and b) a{u) > 0, (3{u) ^ const. 

Case 1: a{u) = 0. Then |c/m(ii)P |m|-2^("), so that \gm{u)\'^ < K\gm{u*)\'^ a nd x 
|^|-2i/(n Hence, in the discrete case, the asymptotical minimax lower bounds in (5.17) and the 
asymptotical minimax upper bounds in (5.18), for the L^-risk, follow directly from Theorems [l] 
and[2| respectively. 

In order to complete the proof, we need to obtain the asymptotical minimax lower and upper 
bounds for the L^-risk in the continuous model. For this purpose, observe that, under conditions 



Proof of Lemma ( 

Observe that since v 



(5.8) and (5.10), one has (see, e.g.. Bender & Orzag (1978), pp. 266-267) 



rf (m) X / exp(-2|lnm|z^(n)) dn X |m|-2'^("*)(ln|m|)-i/^. 



(9.22) 



so that Theorems [T] and [2] yield, respectively, the asymptotical minimax lower bounds in (5.11) 
and the asymptotical minimax upper bounds in (5.12), for the L^-risk. 



Case 2: a{u) > and P{u) / const. In this case, /?(«*) = (3i. Therefore, one derives 
K\m\~'^'^^ exp{—a2\Tn\^^) < \gm{u*)\'^ < K\m\~'^'^^ exp{—ai\m\^^) Hence, the asymptotical mini- 



max lower bounds in (5.17) and the asymptotical minimax upper bounds in (5.18), for the L -risk, 



follow directly from Theorems [T] and [2| respectively. 

To obtain the asymptotical minimax lower and upper bounds for the L^-risk in the continuous 
model, note that 



On the other hand, 



exp 



-a(n)|m|'^(")) du < C^ib - a)\m\-^''^ exp(-Qi|m|^i). 



rf(m) >K|m|-2^2 ^ exp (^-Q(n)|m|^(")) du. 



).23) 



.24) 



Since (3{-) is a continuously differentiable function in some neighborhood of u* , \u—u*\ < d, we have 
P{u) < P{u*) + (3^\u — u*\, where (3* = max|„„„,|<^ |/3'(n)|. Therefore, using the inequality < 
l + 3z for < z < 1, we obtain |m|'^(") < |m|'^i exp {P^\u — n*| In |m|) < | m| (1-|- 3/3*1 it — In |m|) 
for |u - < In |m|/(3/3*). Denote n^m{u*) = {u e U : \u-u*\< |m|-(^i+i)} . Then, 



exp I -a(u)|m|^(")) > e-^^l'"!"! / exp (-a2\mf^3(3^\u-u*\ In \m\) du 



exp(— a2|?n.|'^^^^ 



(9.25) 
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since 3/5=Ka2|"i| ln|m| < 1 for \m\ large enough. Combining (9.23)- (9.25), we derive that 



Ke-^ |mr(2''2+/3i+i) < rf(m) < K{b - a)\m\-'^''^ exp (^-ai 



m 



I ft 



).26) 



so that Theorems [T] and [2] yield, respectively, the asymptotical minimax lower bounds in ( 5.11[ ) 
and the asymptotical minimax upper bounds in (5.12), for the L^-risk. □ 



Proof of Theorem ^ First consider the case when a{u) = 0. From (5.8), it follows that 
Tf{m,u,Mn) < ir|m| \ so that e„ < i^(ln |m| Since, in this case, ln|m| x Inn and 
In |m| > 1 as n ^ cxd, one has e„ = 0((ln n)^^) , where A3 = max(Ai,0). The latter, in combi- 
nation with (5.14), implies that condition (3.10) holds and, moreover, that C njln n)-^ ^ < n* < 
Cn(lnn)'^^. Then, Theorems [l] and [2] imply that under conditions (5.8), (5.13) and (5.14), one has 
Ri{BI,q{A),u,M) > R^iB^giA)), where Ri{B^,j{A)) is given by expression ( |5.17 ) and that, as 
n — > 00, 



sup E||/„^-/f< 



C (n^Hlnn)^i+^2) 2.+2.(.*)+i (inn)^, if iy{u*){2 - p) < ps* , 
C7U-i(lnn)i+^i+^2j ' (lnn)^ if q(u) = 0, K?^*)(2 - p) > ps*, 
where p is defined in (3.14). If, moreover, ( 5.15[ ) holds, then Theorem [T] yields, as n — > 00, 

C (n-i(lnn)^i+^2)-2.+2,'(t.*)+i ^ if v{u*){2 - p) < ps* , 

2s* 

c(n-i(lnn)i+^i+^2)'^*+"^^"*\ if iy{u*)i2 - p) > ps* . 



Ri{Blg{A),u,M)> 



To complete the proof of this part, compare the above upper and lower bounds with (5.11) and 
(5121). 



Now, let a{u) > 0. Then, due to assumption (3.10) one has Inn* x Inn. Under condition 

(g, by Theorem [l| Ri{Blg{A) , u, M) > R^{B^ g{A)) > C(lnn)-2^*//3(«*), as n ^ 00. Also, 

by Theorem jij supjg^s ^(^) E||/^ - /p < C(lnn)~^''*/^("*\ as n ^ 00. To complete the proof, 

compare the above lower and upper bounds for the L^-risks with the corresponding bounds in 
( STTTj ) and ( [5l2| ). □ 

Proof of Theorem |6| Note that conditions ( |5.19 ), (5.22) and Theorem [T] imply that, as n — > 00, 



C(n*) 2s+2^+i (lnn*)2s+2,.+i ^ if a = 0, v{2 - p) < ps* , 



<(S^,,(^),n,M)><j c(i^)"*+'''(lnn*)ra, if a = v{2 - p) > ps* , 

C(lnn*)"^, if a > 0. 



).27) 



Denote the ratio between the upper bound for the L^-risk (5.18) in the continuous model and 
the lower bound (9.27) by A„ = sup^g^s ^(^) E||/^* - ff j Ri{B'pq{A),u,M) and observe that 
the convergence rates m the discrete model are inferior to the convergence rates in the continuous 
model if lim„_oo(lnn)''''("(")=°) A„ = for any /i > 0. 

Let a{u) = and consider the case when z/(2 — p) < ps* . Then, taking into account that 
under condition (3.10) one has Inn* x Inn, we obtain 

lim A„, (Inn)'' = O ( lim n~ 2i+2^^^?T+i (Inn)^^'' (n*)2s+2t+i (Inn*) "2^+2^+1 



O I lim 

, n— >oo 



(Inn) 



p+h- 



2a+2i^+l 12 \2s+2ij(u*) + l 2s+2 



Ih:t(1+^o)) 
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Now, if u > i^iu*), then it is easy to see that under condition (5.20) we have hm„^oo A„(lnn)'' = 
for any h, and the convergence rates in the discrete model are inferior in this case. If u = z^fu*), 

"(lnn)''+'^ 



then hm„^oo (In n) 



O ( hm„ 



if condition (5.211 



"28+21^+1 (£^)2s+2i/+l 

holds. The sparse case when i/(2 — p) < ps* can be treated in a similar manner. 

_ 2s* 2s* 

Now, consider the case when a(n) > 0. One has lim^^oo = linin-+oo(lii n) ''("*)(lnn*) 



-2s* 



(1 + g p) " lim„_^oo(lnn 
(|5:23|),lim„^ooA„ = 0. 

Proof of Corollary [l| Note that if M = M„ is finite, then for a{u) = one has Tf{m, u, Mn 



/3(u*) 13) ^ g^j^^ jg gasy to see that under each set of conditions in 

□ 



-2v 



where v = mm.{i'(ui),v{u2), ■ ■ ■ ,v{um))- If a{u) > 0, then denote Iq = argmin^ /3(?x;), 
P = f3{uig), vq = v{uiq) and ao = (^{ui^). In this case, Tf{m,u,Mn) ^ |m|~^'^°exp (— ao|?Ti|^) and, 
hence, the validity of the corollary follows from Theorems [5] and [6} □ 

Proof of Corollary [2]. Note that T|^(m,?x, M„) > K(lnn)~^* \m\~'^'^^'^*^ and, hence, the validity 
of the corollary follows Theorem |5j □ 

Proof of Corollary [sj Note that, for ui such that = P{u*) one has Tf{m,u,Mn) > 

exp (— a(ii/)|m|^*^"*-') . Then, the validity of the corollary follows from Theorem pi 



K n 

□ 



-2v(ui) 



Proof of Theorem [T]. 

First, consider the case when a{u) = 0. Denote v{x) = v[S{x))^ x* = q{u*) and let 1* be 
the index of a point closest to x*, i.e., I* = argmin \x* — {I — \ + d)/M\. Note that v{x*) = v{u*) 
and the function v{x) is continuously differentiable with < vq f or some vq > 0. Note that 

if we show that under condition (^5.25^ we have 



T^{m,u,Mn) > K|m|-2''("*)(lnn)^ 



).28) 



for some constant A G M, then the validity of the theorem will follow from Theorem [Sj In order to 
prove (9.28), note that 



Tf{m,u, M) 



M 



M 

1=1 



M 



1=1 



> 



M 



\m\ 



M 
1=1 



2"ol'-'*l "5 K , 
M M > — \m 



M/2-1 

k=0 



M M 



where Vq = VQi{x* ^ {I* — 1 + d)/M). Now, recall the following statement from Calculus: if u{z)^ 
z > 0, is a continuous, positive, monotonically decreasing function, then 



W2-1 / M/2 

> uik) > max / 
to Vo 



u(x)dx, n(0) + 



M/2 \ ^ / 

u{x)dx > - u{0) + 



M/2 



Applying (9.29) with u{x) = |m 
— 

\^\-2u(u*)-jj- j^-i -^g obtain 



1^1 



u{x)dx . (9.29) 



, and taking into account that J^^^'^ u{x)dx 



T^{m,u,M) > (ln|m|)"^ (l + M"^ ln|m|) x exp {-Vq M''^ ln\m\) . 
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Now, recall that ln|m| x Inn in this case and note that under the first assumption in (5.25), 
Ti{m,u, M) satisfies condition (5.13) of Theorem [s] with Ai = —1 and = 1- Hence, the conver- 
gence rates in the discrete and the continuous models almost coincide. 

Now, consider the case when a{u) > 0. Denote v{x) = P{S{x)) and let, as before, x* = q{u*) 
and /* = argmin|x* — (/ — 1 + d)/M\. Note that v{x*) = f3{u*) and that the function v{x) 
is continuously differentiable with < vq for some constant vq > 0. Denote, as before. 



volix* / (r - 1 + d)/M). Note that r{'(m, u, M) > M-^K \m\-'^''^ Y.i=i exp 



-ai m 



|/3{«i)) ^ 



where i^i = maxz^(u), ai = maxa(ii), u G U, and, in order to prove the statement, we need to 
construct a lower bound for S{m, M) = exp (— Q:i|m|^^'^'^) . Similarly to the polynomial case. 



we obtain that S{m,M) > K X]fc=o ^ exp { —ai\m 



PI" )^ M ^ 2M 



. Denote a 



m = ai\m 



and apply the inequality (9.29) with u{x) = exp 



VQ k 

-am \m\ ~M" 



Observe that 



M/2 



u{x)dx 



M 







z;Qln|m| Ji 



|m|"o/2 



z ^ exp{—amz)dz > 



M 



2vq In \m\ar, 



exp(-am), 



and recall that In \m\ x Inlnn. Hence, under the second of conditions in (5.25), as M — > oo and 
\m\ — > oo, we derive that 

S{m,M) > KM(lnn)~i exp (^-ai|m|'^("*) exp[0.5voM-^ In |m|]) . 



Note that due to assumption ( |5.25 ) and due to ln|m| >i Inlnn, there exists rs > such that 

In \m\ < r3 when n is large enough. Therefore, Tf{m,u, M) > C (lnn)~-^ |77),|-(2i^i+/3(M*)+i) 
exp (— ai exp(0.5 VqT^) |m|^("*)) . Application of Theorem [s] with v = ui + 0.5 13{u*) + 0.5, a = 
ai exp(0.5 VqT^) and e„ = (In n)~^ completes the proof of this part of the statement. 

To prove the last statement in Theorem [7] recall a simple fact from Calculus: if function 
F(x), X e [0, 1], is continuously differentiable with Fq = max^^ |F'(x)|, then for any d such that 
< d < 1 one has 



M 1 

V F ((Z - 1 + d)/M) - / F(x)d2 

Jo 



< 0.5 M"^ Fq. 



).30) 



Let a{u) = 0. Note that since !(?'(«) | is bounded and separated from zero, one has Jq \gm{S{x))\'^dx 
Ja \9miu)\'^q'{u)du X r{^(m). Therefore, if 



R{m,n) =Tf{m,u,Mn) - / \gm{S{x))\^dx = o{T^{m)) , 

Jo 



).311 



as n ^ oo, then Tf{m,u,Mn) Tf(m) and the theorem is proved. Applying formula (9.30) to 
F{x) = \gm{S{x))\'^ and noting that |5"(x)| < S2, we obtain 



ii(m, n) < 0.5 S2 -M„ ^ max -j-\gmiu)\ 



u(l^[a,b] 



du 



O ( M-^ \m\ 



m\ 



Comparing the last expression with rf(m) given by formula (9.22), we confirm that condition 
(9.31 ) holds and theorem is valid in this case. □ 



Proof of Lemma [4]. Recall that 0<a<6<c>o, /?i< (3{u) < (32, u G [a,b], for some 
< /3i < /32 < oo, and ui = a + {b — a)l/M, 1 = 1,2,..., M. Consider first the case when a G N. 
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Then, A-K'^m'^M Tf{m,u,M) = Y^fL^ (3'^ {u) sm^ {2-Kmui) > X^fi sin^ (27rm(6 - a)//M) . Using 
the formula 1.351.1 of Gradshtein & Ryzhik (1980) with x = 2iim{b — a)/M, n = M and k = I, 
we obtain 

df ^ /?2 fM cos(2(M + l)7rm(6 - a)/M) sin(27rm(b - a)) \ 
"1 ^' ^ 4^MI [Y 2sin(2vrm(6-a)/M) ) ' ('"''^ 

Since 2-^ < \m\ < 2-k j'iV and Inn < 2^ < n^^^, the condition M„ > (327r/3)(6 — a)n^/^ guarantees 
that |27rm(6 — a)/M| < tt/2. In this case, using the inequahty y < 2sin(y), < y < 7r/2 (see, e.g., 
Lang (1966), p. 41), we derive 

2sin(27rm(5-a)/M) > 2nm{b-a)/M > (47r2(6 - a) In n)/(3M). (9.33) 



Hence, combining (9.32) and (9.33), for n large enough, we arrive at T^{'m,u, M) > Km 



Consider now the case when a N. A standard trigonometrical identity yields 

^sin2(27rma + 27rm(6 - a)//M) = - (^M - ^ cos(47rma + 47rm(6 - a)//M^ . (9.34) 
(=1 1=1 

Using the formula 1.341.3 of Gradshtein & Ryzhik (1980) with x = 47rma, y = 47rm(6 — a)/M, 
n = M and k = I, we derive for M > 4(6 — a)|m|, 

E^' , ,, cos(47rma + 27rm(6 - a)(M - 1)/M) sin(27rm(6 - a)) 
cos(47rma + 47rm(6 - a)l M) = ^ — (^—^ ^ ^ ^ 
V 7/ 7 sin(27rm(6-a)/M) 

M 

< 9.35 

Trm{o — a) 



Hence, combining (9.34) and ( 9.35| ) in a manner similar to the first part of the proof, for |m| large 



enough, we arrive at rf'(m, u, M) > Km ^ which completes the proof. □ 

Proof of Theorem [H 

The proof follows directly from the discussion of Section [6] by combining Theorem [T] Theo- 
rem [2] and Lemma |4| taking Aj = Cj = {m : ipmjk 7^ 0}, and noting that, for the Meyer wavelets, 
Cj C 27r/3[-2J+2, -2i] U [2-'',2^+2] with \Cj\ = 47r2J' (see, Johnstone et.al. (2004, p. 565). □ 
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